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METHODS AND COMPOSITIONS COMPRISING RENILLA GFP 

This is a continuing application of U.S.S.N. 60/164.592, filed November 10, 1999, hereby expressly 
incorporated by reference. 

FIELD OF THE INVENTION 

5 The invention relates to methods and compositions utilizing Renilla green fluorescent proteins (GFP). 
In particular, the invention relates to the use of Renilla GFP proteins as reporters for cell assays, 
particularly intracellular assays, including methods of screening libraries using GFP. 

BACKGROUND OF THE INVENTION 

The field of biomolecule screening for biologically and therapeutically relevant compounds is rapidly 
1 0 growing. Relevant biomolecules that have been the focus of such screening Include chemical 
libraries, nucleic acid libraries and peptide libraries, in search of molecules that either inhibit or 
augment the biological activity of identified target molecules. With particular regard to peptide 
libraries, the isolation of peptide inhibitors of targets and the fdenUfication of formal binding partners of 
targets has been a key focus. However, one particular problem with peptide libraries is the difficulty 
15 assessing whether any particular peptide has been expressed, and at what level, prior to determining 
whether the peptide has a biological effect. 

The green fluorescent protein from Aequorea Victoria (termed herein "aGFP") is a 238 amino acid 
protein. The crystal structure of the protein and of several point mutants has been solved (Ormo et 
al.. Science 273, 1392-5. 1996; Yang et al.. Nature Biotechnol. 14, 1246-51. 1996). The fluorophore. 

20 consisting of a modified tripeptide. is buried inside a relatively rigid beta-can structure, where it is 
almost completely protected from solvent access. The fluorescence of this protein is sensitive to a 
number of point mutations (Phillips. G.N.. Cun*. Opin. Stmct. Biol. 7. 821-27. 1997). The fluorescence 
appears to be a sensitive indication of the preservation of the native structure of the protein, since any 
disruption of the structure allowing solvent access to the fluorophoric tripeptide will quench the 

25 fluorescence. 

A GFP from Renilla mulleri (termed herein "rOFP"), has been reported recently; see WO 99/49019. 
hereby expressly incorporated by reference. 
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It is an object of the present invention to provide methods and compositions comprising rGFP. 

SUMMARY 

In accordance with the above objects, the present invention provides retroviral vectors comprising a p- 
or rGFP gene. These vectors can further comprise a first gene, and IRES site, and the p- or rGFP 
gene. 

In an additional aspect, the invention provides libraries of fusion nucleic acids, each fusion nucleic acid 
comprising a gene encoding a random peptide; and a gene encoding a p- or rGFP; the fusion nucleic 
acids can further comprise fusion partners. 

In a further aspect, the present invention provides libraries of retroviral vectors comprising a library of 
fusion nucleic acids, each fusion nucleic acid comprising a gene encoding a random peptide; and a 
gene encoding a p- or rGFP. 

In an additional aspect, the invention provides methods of screening for bioactive agents capable of 
modulating the activity of a promoter of interest The methods comprise combining a candidate 
bioactive agent and a cell comprising a fusion nudeic acid comprising a promoter of interest; and a 
nucleic acid encoding a p- or rGFP protein. The promoter may be optionally induced, and then the 
presence of the p- or rGFP protein is detected. 

DETAILED DESCRIPTION OF THE FIGURES 

Figure 1 depicts a homology lineup between the GFPs of Renilla Mullen, Pitilosarcus Gumeyi, 
Aequorea and its enhanced version. EGFP. The underiined residues are the fluorescent tripeptide 
(chromophore). Identity, strong similarity and weak similarity are depicted. 

Figure 2 depicts the nucleic acid sequence of the wild type rGFP. 

Figure 3 depicts the nucleic acid sequence of the wild type pGFP. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention is directed to the use of Renilla green fluorescent protein (hereinafter "rOFP"). in 
a variety of methods and compositions that exploit the autofluorescent properties of rGFP. These 
methods include, but are not limited to, the use of rGFP as a reporter molecule in cell screening 
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assays, including intracellular assays; the use of rGFP as a scaffold protein for fusions with random 
peptide libraries; etc. Similarly, compositions of rGFP are provided, including constructs of rGFP such 
as fusion constructs that include rGFP as a reporter gene, retroviral constructs including rGFP and 
Intemal ribosome entry sites (IRES), etc. Basically, the invention provides a numl>er of novel uses for 
5 rGFP. similar to those outlined for aGFP in WO 95/07463. hereby incorporated by reference in its 

entirety. In addition, the invention is also directed to the use of Pitilosarcus Gumeyi green fluorescent 
protein ("pGFP"). the amino acid sequence of which is shown in Figure 1 and is also depicted in WO 
99/49019. It should be noted that while the discussion below is directed to rGFP, pGFP may be used 
as well. 

10 In a preferred embodiment, the Invention provides compositions including rGFP. By "Renilla green 
fluorescent protein" or "rOFP" herein is meant a protein that has significant homology, as defined 
herein, to the wild-type protein of Figure 1. as depicted in WO 99/49019, hereby Incorporated by 
reference in its entirety. 

In a preferred embodiment, the invention provides compositions including pGFP. By Ptilosarcus green 
15 fluorescent protein" or *pGFP" herein is meant a protein that has significant homology, as defined 
herein, to the wild-type protein of Figure 3. as depicted in WO 99/49019, hereby incorporated by 
. reference In its entirety. 

An rGFP or pGFP protein of the present invention may be identified in several ways. "Protein" in this 
sense includes proteins, polypeptides, and peptides. A r- or pGFP nucleic acid or protein is initially 
20 identified by substantial nucleic acid and/or amino acid sequence homology to the sequences shown 
in Figures 1, 2 and 3. Such homology can be based upon the overall nucleic add or amino acid 
sequence. 

As used herein, a protein is a "rGFP protein" or "pGFP" if the overall homology of the protein 
sequence to the amino acid sequence shown in Figures 2 or 3 is preferably greater than about 75%, 
25 more preferably greater than about 80%. even more preferably greater than about 85% and most 

preferably greater than 90%. In some embodiments the homology will t>e as high as about 93 to 95 or 
98%. 

Homology in this context means sequence similarity or identity, with identity being preferred. This 
homology will be determined using standard techniques known in the art, including, but not limited to, 
30 the local homology algorithm of Smith & Waterman, Adv. Appl. Math. 2:482 (1981 ), by the homology 
alignment algorithm of Needleman & Wunsch. J. Mol. Biol. 48:443 (1970), by the search for similarity 
method of Pearson & Lipman. Proc NaU. Acad. Sci. U.S.A. 85:2444 (1988). by computerized 
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implementations of these algorithms (GAP. BESTFIT. FASTA. and TFASTA in the Wisconsin 
Genetics Software Package. Genetics Computer Group. 575 Science Drive. Madison. Wl), or the Best 
Fit sequence program described by Devereux et al.. Nucl. Acid Res. 12:387-95 (1984), preferably 
using the default settings, or by inspection. 

5 In a prefen-ed embodiment, similarity is calculated by FastDB based upon the following parameters: 
mismatch penalty of 1.0; gap size penalty of 0.33. joining penalty of 30.0 ("Cun-ent methods in 
Comparison and Analysis". Macromolecule Sequencing and Synthesis, selected methods and 
Applications, pp. 127-149 (1998). Alan R. Liss. Inc.)- Another example of a useful algorithm is 
PILEUP. PILEUP creates a multiple sequence alignment from a group of related sequences using 
1 0 progressive, pain^^ise alignments. It can also plot a tree showing the clustering relationships used to 
create the alignment. PILEUP uses a simplification of the progressive alignment method of Feng and 
Doolittle. J. Mot. Evol. 35:351-60 (1987): the method Is similar to that described by Higgins and Stiarp 
CABIOS 5:151-3 (1989). Useful PILEUP parameters including a default gap weight of 3.00, a default 
gap length weight of 0.10, and weighted end gaps. 

15 An addiUonal example of a useful algorithm is the BLAST algorithm, described in Altschul et al., J. Mol. 
Biol. 215: 403-410 (1990) and Kariin et al.. Proc. Natl. Acad. Sd. U.S,A. 90:5873-87 (1993). A 
particularty useful BLAST program Is the WU-BLAST-2 program which was obtained from Altschul et 
al-. Methods in Enzymology 266:460-480 (1996); http J/blasLwustl/eduA)last/ README-htmlJ. WU- 
BLAST-2 uses several search parameters, most of which are set to the default values. The adjustable 

20 parameters are set with the following values: overiap span =1 . overtap fracUon = 0.125. word threshold 
(T) = 11. The HSP S and HSP S2 parameters are dynamic values and are established by the 
program itself depending upon the composition of the particular sequence and composition of the 
particular database against which the sequence of interest is being searched; however, the values 
may be adjusted to increase sensitivity!*^ A % amino add sequence identity value is determined by the 

25 number of matching Identical residues divided by the total number of residues of the "longer* 

sequence in the aligned region. The "longer" sequence Is the one having the most actual residues in 
the aligned region (gaps introduced by WU-Blast-2 to maximize the alignment score are Ignored). 

In a similar manner, "percent (%) nucleic acid sequence identity* with respect to the coding sequence 
of the polypeptides identified herein is defined as the percentage of nucleotide residues in a car.rj.: - te 
3 0 sequence that are identical with the nucleotide residues in the coding sequence of the rGFP proteins 
(see Figure 1). A prefen-ed method utilizes the BLASTN module of WU-BLAST-2 set to the default 
parameters, with overlap span and overiap fraction set to 1 and 0.125, respectively. 

An additional useful algorithm is gapped BLAST as reported by Altschul et al.. Nucl. Acid Res. 
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25:3389-3402 (1997). Gapped BLAST uses BLOSUM-62 substitution scores; threshold 7 parameter 
set to 9; the two-hit method to trigger ungapped extensions; charges gap lengths of /c a cost of 10+/c; 
X„ set to 16, and set to 40 for database search stage and to 67 for the output stage of the 
algorithms. Gapped alignments are triggered by a score corresponding to -22 bits. 

5 The alignment may include the introduction of gaps in the sequences to be aligned. In addition, for 
sequences which contain either more or fewer amino acids than the protein sequences shown in 
Figure 1, it is understood that the percentage of homology will be determined based on the number of 
homologous amino acids in relation to the total number of amino acids. Thus, for example, homology 
of sequences shorter than that shown in Figure 1 . as discussed below, will be determined using the 
10 number of amino acids In the shorter sequence. 

GFP proteins of the present invention may be shorter or longer than the amino acid sequences shown 
in Figure 1 . Thus, in a preferred embodiment, included within the definition of GFP proteins are 
portions or fragments of the sequences depicted herein. Portions or fragments of r- and pGFP 
proteins are considered GFP proteins if a) they share at least one antigenic epitope; or b) have at least 
15 the indicated homology; c) preferably have GFP biological activity, e.g., including, but not limited to, 
autofluorescence; or d) fold into a stable structure that is similar to the wild-type structure. 

For example, r- and pGFP deletion mutants can be made. At the N-terminus. It is known that only the 
first amino add of the aGFP protein may l>e deleted without loss of fluorescence. At the C-terminus of 
the aGFP, up to 7 residues can be deleted without loss of fluorescence; see Phillips et al.. Current 
20 Opin. Structural Biol. 7:821 (1997)). This presumably applies to rGFP as well. 

In one embodiment, the r- and pGFP proteins are derivative or variant GFP proteins. That is. as 
outlined more fully t>elow, the derivative GFP will contain at least one amino acid substitution, deletion 
or insertion, with amino add substitutions being particulariy preferred. The amino add substitution, 
insertion or deletion may occur at any residue within the GFP protein. These variants ordinarily are 

25 prepared by site spedfic mutagenesis of nucleotides in the DNA encoding the GFP protein, using 

cassette or PCR mutagenesis or other techniques well known in the art. to produce DNA encoding the 
variant, and thereafter expressing the DNA in recombinant cell culture as is known in the art and 
outlined huiuin. However, variant GFP protein fragments having up to about 100-150 residues may be 
prepared by in vitro synthesis using established techniques. Amino add sequence variants are 

3 0 characterized by the predetermined nature of the variation, a feature that sets them apart from 
naturally occurring allelic or interspecies variation of the GFP protein amino add sequence. The 
variants typically exhibit the same qualitative biological activity as the naturally occurring analogue, 
although variants can also be selected which have modified characteristics as will be more fully 
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outlined below. Thai is. in a preferred embodiment, when non-wild-type GFP is used, the derivative 
preferably has at least 1% of wild-type fluorescence, with at least about 10% being preferred, at least 
about 50-60% being particularty preferred and 95% to 98% to 100% being especially preferred. In 
general, what is important is that there is enough fluorescence to allow sorting and/or detection above 
5 background, for example using a fluorescence-activated cell sorter (FACS) machine. However, in 

some embodiments, for example when fusion proteins with GFP are made, it is possible to detect the 
fusion proteins non-fluorescently. using, for example, antibodies directed to either an epitope tag (i.e. 
purification sequence) or to the GFP itself. In this case the GFP scaffold does not have to be 
fluorescent, if it can be shown that the GFP is folding correctly and/or reproducibly. 

1 0 Thus, the rGFP or pGFP may be wild type or variants thereof. These variants fall Into one or more of 
three classes: substitutional, insertional or deletional variants. These variants ordinarily are prepared 
by site specific mutagenesis of nucleotides in the DNA encoding the GFP, using cassette or PGR 
mutagenesis or other techniques well known in the art. to produce DNA encoding the variant, and 
thereafter expressing the DNA in recombinant cell culture as ouUined herein. However, variant protein 

1 5 fragments having up to about 1 00-1 50 residues may be prepared by In vitro synthesis using 

established techniques. Amino acid sequence variants are characterized by the predetermined nature 
of the variation, a feature that sets them apart from naturally occurring allelic or Interspecies variation 
of the rGFP amino acid sequence. The variants typically exhibit the same qualitative biological activity 
as the naturally occurring analogue, although variants can also be selected which have modified 

2 0 characteristics as will be more fully outlined below. 

While the site or region for introducing an amino acid sequence variation is predetermined, the 
mutation per se need not be predetermined. For example. In order to optimize the performance of a 
mutation at a given site, random mutagenesis may be conducted at the target codon or region and the 
expressed scaffold variants screened for the optimal combination of desired activity. Techniques for 
25 making substitution mutations at predetermined sites in DNA having a known sequence are well 

known, for example, M13 primer mutagenesis and PCR mutagenesis. Screening of the mutants is 
done using assays of scaffold protein activities. 

Amino acid s j!iStilutions are typically of single residues; insertions usually will be on ttie order of from 
cL^oiil 1 10 20 amino acids, although considerably larger insertions nv. be tolerated. Deletions range 

3 0 from about 1 to about 20 residues, although in some cases deletions may be much larger. 

Substitutions, deletions. Insertions or any combination thereof may be used to arrive at a final 
derivative. Generally these changes are done on a few amino acids to minimize the alteration of the 
molecule. However, larger changes may be tolerated in certain circumstances. When smali 
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alterations in the characteristics of the p- or rGFP protein are desired, substitutions are generally 
made in accordance with the following chart: 



Orioinal Residue 

5 Ala 
Arg 
Asn 
Asp 
Cys 

10 Gin 
Glu 
Gly 
His 
lie 

15 Leu 
Lys 
Met 
Phe 
Ser 

20 Thr 
Trp 
Tyr 
Val 



Chart I 

Exemplary Substitutions 

Ser 
Lys 

Gin, His 

Glu 

Ser 

Asn 

Asp 

Pro 

Asn, Gin 
Leu. Val 
He. Val 
Arg. Gin. Glu 
Leu. lie 
Met, Leu, Tyr 
Thr 
Ser 
Tyr 

Trp, Phe 
lie. Leu 



Substantial changes in function or immunological identity are made by selecting substitutions that are 
25 less conservative than those shown In Chart I. For example. sut>stitutions may be made which more 
significantly affect: the structure of the polypeptide backbone In the area of the alteration, for example 
the alpha-helical or beta-sheet structure; the charge or hydrophobicity of the molecule at the target 
site; or the bulk of the side chain. The substitutions which in general are expected to produce the 
greatest changes in the polypeptide's properties are those in which (a) a hydrophilic residue, e.g. seryl 
30 or threonyl, as substituted for (or by) a hydrophobic residue, e.g. leucyl, isoleucyl. phenylalanyt. valyl or 
alanyl; (b) a cysteine or proline is substituted for (or by) any otiier residue; (c) a residue having an 
electropositive side chain, e.g. iysyl. arginyl, or histidyl.ls substituted for (or by) an electronegative 
residue, e.g. glutamyl or aspartyl; or (d) a residue having a bulky side chain, e.g. phenylalanine, is 
substituted for (or by) one not having a side chain. e.g. glycine. 

35 As outlined above, the variants typically exhibit the same quaM'tative biological activity (i.e. 

fiuorescet ice) althoug!i variants also are selected ...odiiy u.^ characteristic? of the GFP protein as 
needed. 



40 



In a preferred embodiment specific residues of rGFP and/or pGFP are substituted, resulting in 
proteins with modified characteristics. Such substitutions may occur at one or more residues, with 1- 
10 substitutions being preferred. Preferred characteristics to be modified include range of spectral 
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emission. including shitts in peak emission, rate of folding, stability, expression levels, toxicity, and 
emission intensity. As is known in the art. there are a number of aGFP variants with desirable 
properties, and these may be varied in rGFP and pGFP as well. 

in a preferred embodiment, residue 46 of rGFP and pGFP (corresponding to residue 43 of aGFP) is 
substituted with a Thr or an Ala. 

In a preferred embodimenl. residue 69 of rGFP and pGFP (corresponding to residue 65 of aGFP) is 
substituted with a Thr. lie, Cys. Ser. Leu. Ala or Gly. 

in a preferred embodiment, residue 70 of rGFP and pGFP (corresponding to residue 66 of aGFP) is 
substituted with an His, Phe, orTrp. 

In a preferred embodiment, residue 72 of rGFP and pGFP (corresponding to residue 68 of aGFP) is 
substituted with a Val or Leu. 

In a preferred embodiment, residue 76 of rGFP and pGFP (corresponding to residue 72 of aGFP) is 
substituted with an Ser or Ala. 

In a preferred embodiment, residue 101 of rGFP and pGFP (corresponding to residue 99 of aGFP) is 
substituted with an Phe or Ser. 

In a preferred embodiment, residue 125 of rGFP and pGFP (corresponding to residue 123 of aGFP) is 
sutjstituted with an lie. 

In a preferred embodiment, residue 147 of rGFP and pGFP (corresponding to residue 145 of aGFP) is 
sut)stituted with a Tyr. Phe or His. 

in a preferred embodiment, residue 148 of rGFP and pGFP (corresponding to residue 146 of aGFP) is 
substituted with an N or 1. 

: •• preferred embodimenl. .icidue 1:0 of rGFP and pGFP (corresponding lo residue 148 of aGFP) is 
substituted with an His or Arg. 

In a preferred embodiment, residue 155 of rGFP and pGFP (corresponding to residue 153 of aGFP) is 
substituted with a Thr or Ala. 
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In a preferred embodiment, residue 162 of rGFP and pGFP (corresponding to residue 163 of aGFP) is 
substituted with a Val or Ala. 

In a preferred embodiment, residue 166 of rGFP and pGFP (corresponding to residue 167 of aGFP) is 
substituted with an lie or Thr. 

5 In a preferred embodiment, residue 200 of rGFP and pGFP (corresponding to residue 202 of aGFP) is 
substituted with an Ser or Phe. 



In a preferred embodiment, residue 201 of rGFP and pGFP (corresponding to residue 203 of aGFP) is 
substituted with an He or Thr. 

In a preferred embodiment, residue 203 of rGFP and pGFP (corresponding to residue 205 of aGFP) is 
1 0 substituted with an Ser or Thr. 



In a preferred embodiment, residue 210 of rGFP and pGFP (corresponding to residue 212 of aGFP) is 
sut)sUtuted with an N or Val. 

In addition, rGFP and pGFP proteins can be made that are longer than the wild-type, for example* by 
the addition of epitope or purification tags« the addition of other fusion sequences, etc., as is more fully 
15 outlined below. 



In a prefen'ed embodiment, the p- or rGFP protein is fused to a protein of interest. This may be done, 
for example, to allowing tracking or localization of the protein of Interest to a particular subcellular 
location, or to allow for quantification of expression, etc. 

In a preferred embodiment, the r- or pGFP is fused to a random peptide to form a fusion polypeptide. 

20 By "fused" or "operably linked" herein is meant that the random peptide, as defined below, and the 
GFP protein are linked together, in such a manner as to minimize the disruption to the stability of the 
GFP structure (i.e. it can retain biological activity). That is. the GFP preferably retains its ability to 
fluoresce, or maintains a Tm of at least 42*C. As outlined below, the fusion polypeptide (or fusion 
pulynucleotidc encoding the fusion polypeptide) c.?n comprise further components as well, including 

25 multiple peptides at multiple loops, fusion partners, etc. 

The fusion polypeptide preferably includes additional components, including, but not limited to, fusion 
partners and linkers. 
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In a preferred embodiment, the random peptide is fused to the N-terminus of the p- or rGFP. The 
fusion can be direct, i.e. with no additional residues between the C-terminus of the peptide and the N- 
terminus of the p- or rGFP. or indirect; that Is. intervening amino acids are used, such as one or more 
fusion partners, including a linker. In this embodiment, preferably a presentation structure is used, to 
5 confer some conformational stability to the peptide. Particularly preferred embodiments include the 
use of dimerizatlon sequences. 

In one embodiment. N-terminat residues of the p- or rGFP are deleted, i.e. one or more amino acids of 
the p- or rGFP can be deleted and replaced with the peptide. However, as noted above, deletions of 
more than 7 amino acids may render the p- or rGFP less fluorescent, and thus larger deletions are 
1 0 generally not preferred. In a preferred embodiment, the fusion is direcUy to the first amino acid of the 
p- or rGFP. 

In a preferred embodiment, the random peptide is fused to the C-terminus of the p- or rGFP. As 
above for N-terminal fusions, the fusion can be direct or indirect, and C-temiinal residues may be 



15 



20 



deleted. 



In a preferred embodiment, peptides and fusion partners are added to both the N- and the C-terminus 
of the p- or rGFP. As the N- and C-temiinus of p- or rGFP are putatively on the same "face' of the 
protein as is the case for aGFP. in spatial proximity (within 18 A), it is possible to make a non- 
covalently "circular" p- or iGFP protein using the components of the invention. Thus for example, the 
use of dimerization sequences can allow a noncovalently cyclized protein; by attaching a first 
dimerizatlon sequence to either the N- or C-tenninus of p- or rGFP. and adding a random peptide and 
a second dimerization sequence to the other tenninus. a large compact stnicture can be formed. 

in a preferred embodiment, the random peptide is fused to an internal position of the rGFP or pGFP; 
that is. the peptide Is inserted at an internal position of the p- or rGFP. While the peptide can be 
inserted at virtually any position, preferred positions include insertion at the very tips of "loops" on the 
2 5 surface of the p- or rGFP. to minimize dismption of the p- or rGFP beta-can protein structure. 

Ir. z p'pferred embodiment, the randcr.i peptide is inserted in rGFP and/or pGFP loops. That is. 
libraries of r^.-dom peptides (or. alternatively single p -plides) can be inserted into or replace external 
loops. In a preferred embodiment, the loop comprises rGFP or pGFP residues from about 103 to 
about 106. As outlined below, this can be either an Insertion (e.g. without replacing any residues), or 
30 the addition of the random peptides or other fusion partners results in the replacement of one or more 
of the native residues. Similar preferred embodiments utilize replacements or insertions at positions 
from about 1 17 to about 120 of both rGFP and pGFP: replacements or insertions at positions from 
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about 157 to about 158; replacements or insertions at positions from about 170 to about 173; 
replacements or insertions at positions from about 186 to about 191; or replacements or insertions at 
positions from about 208 to about 213. More preferably the insertion or replacement will take place 
between residues 117-120, 170-173 or 208-213. Most preferably the insertion will take place between 
residues 170-173 or 208-213. 

In a preferred embodiment, the random peptide is inserted, without any deletion of p- or rGFP 
residues. That is, the inseriion point is between two amino acids in the loop, adding the new amino 
acids of the peptide and fusion partners, including tinkers. Generally, when linkers are used, the 
linkers are directly fused to the p- or rGFP, with additional fusion partners, if present, being fused to 
the tinkers and the peptides. 

In a preferred embodiment, the peptide is inserted into the p- or rGFP, with one or more p- or rGFP 
residues being deleted; that is, the random peptide (and fusion partners, including linkers) replaces 
one or more residues. In general, when tinkers are used, the linkers are attached directly to the p- or 
rGFP, thus it is linker residues which replace the p- or rGFP residues, again generally at the tip of the 
loop. In general, when residues are replaced, from one to five residues of p- or rGFP are deleted, wttti 
deletions of one, two, three, four and five amino acids all possible. 

In a preferred embodiment, peptides (including fusion partners, if applicable) can be inserted into 
more than one loop of the scaffold at a time. Thus, for example, adding peptides to two loops can 
increase the complexity of the library but still allow presentation of these loops on the same face of the 
protein. Similarly, it is possible to add peptides to one or more loops and add other fusion partners to 
other loops, such as targeting sequences, etc. 

Thus, fusion polypeptides comprising p- or rGFP and random peptides are provided. Similarly, the 
invention provides fusion nucleic adds encoding the fusion polypeptides. In addition, to facilitate the 
introduction of random peptides into the p- or rGFP, a preferred embodiment provides p- or rGFP 
nucleic acids with a multisite cloning site inserted into at least one loop outiined atx>ve. 

In a preferred embodiment, the fusion polypeptides further comprise fus-on partners. By "fusion 
partner" herein is meant a sequence that is associated with uku rancDm peptide that confers upon all 
members of the library in that class a common function or ability. Fusion partners can be 
heterologous (i.e. not native to the host cell), or synthetic (not native to any cell). Suitable fusion 
partners include, but are not limited to: a) presentation structures, as defined t>elow, which provide the 
peptides in a conformationalty restricted or stable fomi; b) targeting sequences, defined below, which 
allow the localization of the peptide into a subcellular or extracellular compartment; c) rescue 
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sequences as defined below, which allow the purification or isolation of either the peptides or the 
nucleic acids encoding them: d) stability sequences, which confer stability or protection from 
degradation to the peptide or the nucleic acid encoding it, for example resistance to proteolytic 
degradation; e) linker sequences, which conformationally decouple the random peptide elements from 
the scaffold Itself, which keep the peptide from interfering with scaffold folding; or f), any combination 
of a), b), c). d) and e) as well as linker sequences as needed. 

In a preferred embodiment: the fusion partner is a presentation structure. By "presentation structure" 
or grammatical equivalents herein is meant a sequence, which, when fused to peptides, causes the 
peptides to assume a conformationally restricted form. Proteins interact with each other largely 
through confomrwitionally constrained domains. Although small peptides with freely rotating amino 
and carboxyl termini can have potent functions as is known In the art, the conversion of such peptide 
structures into pharmacologic agents Is difficult due to the inability to predict side-chain positions for 
peptidomimetic synthesis. Therefore the presentation of peptides in confomiationally constrained 
structures will benefit both the later generation of pharmacophore models and pharmaceuticals and 
will also likely lead to higher affinity interactions of the peptide with the target protein. This fact has 
been recognized in the combinatorial library generation systems using biologically generated short 
peptides In bacterial phage systems. A number of wori^ers have constructed small domain molecules 
in which one might present randomized peptide structures. 

Thus, synthetic presentation structures, I.e. artificial polypeptides, are capable of presenting a 
randomized peptide as a conformationally-restricted domain. Generally such presentation stmctures 
comprise a first portion joined to the N-temninal end of the randomized peptide, and a second portion 
joined to the C-terminal end of the peptide; that Is. the peptide is inserted into tiie presentation 
structure, although variations may be made, as outiined below, In which elements of tiie presentation 
stmcture are Included within the random peptide sequence. To increase tiie functional isolation of the 
randomized expression product, the presentation structures are selected or designed to have minimal 
biok>gically activity when expressed In the target cell. 

Prefen^ed presentation structures maximize accessibility to the peptide by presenting it on an exterior 
surface such as a loop, and also cause further conformational constraint.?: in a peptide. Accordingly, 
suitable presentation sU uclures include, but are not limited to. dim . i.^... sequences, minibody 
structures, loops on p-tums and coiled-coll stem structures in which residues not critical to structure 
are randomized, zinc-finger domains, cysteine-linked (disulfide) structures, transglutaminase linked 
structures, cyclic peptides, B-loop structures, helical barrels or bundles, leucine zipper motifs, etc. 

In a prefen^ed embodiment, tiie presentation structure is a coiled-coii stmcture. allowing the 
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presentation of the randomized peptide on an exterior loop. See. for example. Myszka et al., 
Biochem. 33:2362-2373 (1994). hereby incorporated by reference). Using this system investigators 
have isolated peptides capable of high affinity interaction with the appropriate target. In general, 
coited-coil structures allow for between 6 to 20 randomized positions. 

5 A preferred coiled-coil presentation structure is as follows: 

MGC AALESEVSALESEVASLjSEVAAL GRGDMP LAAVKStaSAVKSKLASVKSKLAA CnPP The 
underlined regions represent a coiled-coil leucine zipper region defined previously (see Martin et al.. 
EMBO J. 13(22):5303-5309 (1994). incorporated by reference). The bolded GRGDMP region 
represents the loop structure and when appropriately replaced with randomized peptides (i.e. 
1 0 peptides, generally depicted herein as (X)„. where X is an amino acid residue and n is an integer of at 
least 5 or 6) can be of variable length. The replacement of the bolded region is facilitated by encoding 
restriction endonuclease sites in the underlined regions, which allows the direct incorporation of 
randomized oligonucleotides at these positions. For example, a preferred embodiment generates a 
Xhot Site at the double underlined LE site and a Hindlll site at the double-underiined KL site. 

15 In a preferred embodiment, the presentation structure is a minibody structure. A "mtnibody" is 
essentially composed of a minimal antik)ody complementarity region. The minit)ody presentation 
structure generally provides two randomizing regions that in the folded protein are presented along a 
single face of the tertiary structure. See for example Bianchi et al., J. Mol. Biol. 236(2):649-59 (1994), 
and references cited therein, all of which are Incorporated by reference). Investigators have shown 

2 0 this minimal domain is stable in solution and have used phage selection systems in combinatorial 

libraries to select minibodies with peptide regions exhibiting high affinity. Kd = 10~', for the pro- 
inflammatory cytolcine IL-6. 

A preferred minit>ody presentation structure is as follows: 

MGRNSQATSGFT/=^HFYMEWVRGGEYIAAS RHKHNKY TTEYSASVKGRYIVSRDTSQ5?ll vi ntcicwri 
25 PP. The bold, underirne regions are the regions which may be randomized. The ttalized phenylalanine 
must be invariant in the first randomizing region. The entire peptide is cloned in a three-oligonucleotide 
variation of the coiled-coil embodiment, thus allowing two different randomizing regions to be 
incorporated simultaneously. This embodiment utilize? non-palindromic BstXI sites on the termini. 

In a preferred embodiment, the presentation structure is a sequence that contains generally two 

3 0 cysteine residues, such that a disulfide t>ond may fc>e formed, resulting In a conformationally 

constrained sequence. This embodiment is particulariy preferred ex vivo, for example when secretory 
targeting sequences are used. As will be appreciated by those in the art. any number of random 
sequences, with or without spacer or linking sequences, may be flanked with cysteine residues. In 
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other embodiments, effective presentation structures may be generated by the random regions 
themselves. For example, the random regions may be "doped" with cysteine residues which, under 
the appropriate redox conditions, may result in highly crosslinked stnjctured conformations, similar to 
a presentation stmcture. Similarly, the randomization regions may be controlled to contain a certain 
5 number of residues to confer Q-sheet or a-helical structures. 

In a prefen-ed embodiment, the presentation sequence confers the ability to bind metal ions to confer 
secondary stmcture. Thus, for example. C2H2 zinc finger sequences are used: C2H2 sequences 
have two cysteines and two histldines placed such that a zinc ion is chelated. Zinc finger domains are 
known to occur independently in multiple zinc-finger peptides to form structurally independent, flexibly 
1 0 linked domains. See J. Mol. Bid. 228:619 (1992). A general consensus sequence is (5 amino acids)- 
C-(2 to 3 amino acWs)-C-(4 to 12 amino acids)-H-(3 amino acids)-H-(5 amino acids). A prefened 
example wirauld be -FQCEEC-random peptide of 3 to 20 amino acids-HIRSHTG-. 

Similarly. CCHC boxes can be used (see Biochem. Biophys. Res. Commun. 242:385 (1998)). that 
have a consensus seqeunce -C-(2 amino acids)-C-(4 to 20 random peptlde)-H-(4 amino aclds)-C- 

1 5 (see Bavoso et al.. Biochem. Biophys. Res. Comm. 242(2):385 (1998). hereby incorporated by 
reference. Preferred examples Include (1) -VKCFNC-4 to 20 random amino adds-HTARNCR-. 
based on the nudeocapsid protein P2; (2) a sequence modified from tehat of the naturally occuring 
zinc-binding peptide of the Lasp-1 LIM domain (Hammarstrom et al.. Biochem. 35:12723 (1996)): and 
(3) -MNPNCARCG-4 to 20 random amino adds-HKACF-. based on the nmr structural ensemble 1ZFP 

20 (Hammarstrom et al.. Biochem. 35 U.S.C. 35(39):12723 (1996). 

In a preferred embodiment, the presentation stmcture is a dimerizaUon sequence. Induding self- 
binding peptides. A dimerization sequence allows the non-covalent assodation of two peptide 
sequences, which can be the same or different, with suffident affinity to remain assodated under 
normal physiological conditions. These sequences may be used in several ways. In a preferred 

2 5 embodiment, one tenninus of the random peptide is joined to a first dimerization sequence and the 

other tenninus is joined to a second dimerization sequence, which can be the same or different from 
the first sequence. This allows the formation of a loop upon association of the dimerizing sequences. 
Altemalively, the use of these sequences effectively allows small libraries of random peptides (for 
exaiapie. 10") to become large libraries if two peptides per ceil are generated which then dimerize. to 

3 0 form an effective library of 10' (10* X 10*). It also allows the fonnaUon of longer random peptides, if 

needed, or more stmdurally complex random peptide mdecules. The dimers may be homo- or 
heterodlmers. 

Dimerization sequences may be a single sequence that self-aggregates, or two different sequences 
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that associate. That is, nucleic acids encoding both a first random peptide with dimerization sequence 
1, and a second random peptide with dimerization sequence 2. such that upon introduction into a cell 
and expression of the nucleic acid, dimerization sequence 1 associates with dimerization sequence 2 
to form a new random peptide structure. The use of dimerization sequences allows the 
5 "circularization" of the random peptides; that is. if a dimerization sequence is used at each terminus of 
the peptide, the resulting structure can form a "stem-loop" type of structure. Furthermore, the use of 
dimerizing sequences fused to both the N- and C-terminus of the scaffold such as GFP forms a 
noncovalently cyclized scaffold random peptide library. 

Suitable dimerization sequences will encompass a wide variety of sequences. Any number of protein- 
ic protein interaction sites are Icnown. in addition, dimerization sequences may also be elucidated using 
standard methods such as the yeast two hybrid system, traditional biochemical affinity binding studies, 
or even using the present methods. See U.S.S.N. 60/080,444. filed April 2, 1998, hereby incorporated 
by reference in its entirelty. Particularty preferred dimerization peptide sequences indude. but are not 
limited to. -EFLIVKS-. EEFLIVKKS-. -FESIKLV-. and -VSIKFEU 

15 In a preferred emt>odiment. the fusion partner is a targeting sequence. As will be appreciated by 
those in the art. the localization of proteins within a cell is a simple method for increasing effective 
concentration and detemiining function. For example, RAF1 when localized to the mitochondrial 
membrane can inhibit the antl-apoptotic effect of BCL-2. Simllarty, membrane bound Sos induces Ras 
mediated signaling in T-lymphocytes. These mechanisms are thought to rely on the principle of limiting 

20 the search space for ligands. that is to say, the localization of a protein to the plasma membrane limits 
the search for its ligand to that limited dimensional space near the membrane as opposed to the three 
dimensional space of the cytoplasm. Altematively. the concentration of a protein can also be simply 
increased by nature of the localization. Shuttling the proteins into the nucleus confines them to a 
smaller space thereby increasing concentration. Finally, the ligand or target may simply be localized 

25 to a specific compartment, and inhibitors must be localized appropriately. 

Thus, suitable targeting sequences include, but are not limited to, binding sequences capable of 
causing binding of the expression product to a predetermined nrK)lecule or class of molecules while 
retaining bioactivity of the expression product, (for example by using enzyme inhibitor or substrate 
S'.?r :;. .; i3rc *. ' class : relevant enz y.r-.es); sequences signalling selective degradation, of itself 

30 or co-bound proteins; and signal sequences capable of constitutively localizing the peptides to a 
predetermined cellular locale, including a) subcellular locations such as the Golgi, endoplasmic 
reticulum, nucleus, nucleoli, nuclear membrane, mitochondria, chloroplast. secretory vesicles, 
lysosome. and cellular membrane; and b) extracellular locations via a secretory signal. Particulariy 
preferred is localization to either subcellular locations or to the outside of the cell via secretion. 
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In a preferred embodiment. Ihe targeting sequence is a nuclear localization signal (NLS). NLSs are 
generally short, positively charged (basic) domains that serve to direct the entire protein in which they 
occur to the cell's nucleus. Numerous NLS amino acid sequences have been reported including 
single basic NLS's such as that of the SV40 (monkey virus) large T Antigen (Pro Lys Lys Lys Arg Lys 
5 Val). Kalderon (1984). et a!.. Cell, 39:499-509; the human retinoic acid receptor-d nuclear localization 
signal (ARRRRP); NFkB p50 (EEVQRKRQKL; Ghosh et al.. Cell 62:1019 (1990): NFkB p65 
(EEKRKRTYE: Nolan et al.. Cell 64:961 (1991); and others (see for example Boulikas. J. Cell. 
Biochem. 55(1):32-58 (1994). hereby incorporated by reference) and double basic NLS's exemplified 
by that of the Xenopus (African clawed toad) protein, nucleoplasmin (Ala Val Lys Arg Pro Ala Ala Thr 

10 Lys Lys Ala Gly Gin Ala Lys Lys Lys Lys Leu Asp), Dingwall, et al.. Cell, 30:449-458. 1982 and 

Dingwall, et al.. J. Cell Biol., 107:641-849; 1988). Numerous localization studies have demonstrated 
- that NLSs incorporated in synthetic peptides or grafted onto reporter proteins not normally targeted to 
the cell nucleus cause these peptides and reporter proteins to be concentrated in the nucleus. See. 
for example, Dingwall, and Laskey. Ann. Rev. Cell BioL, 2:367-390. 1986; Bonnerot, et al.. Proc. NaU. 

15 Acad. Sci. USA, 84:6795-6799. 1987; Galileo, et al.. Proc. Natl. Acad. Sd. USA, 87:458-462, 1990. 

In a preferred embodiment, the targeting sequence is a membrane anchoring signal sequence. This is 
particularly useful since many parasites and pathogens bind to the membrane, in addition to the fact 
that many intracellular events originate at the plasma membrane. Thus, membrane-bound peptide 
libraries are useful for both the Identification of important elements in these processes as well as for 

20 the discovery of effective inhibitors. The invention provides methods for presenting the randomized 
expression product extracellulariy or in the cytoplasmic space. For extracellular presentation, a 
membrane anchoring region is provided at the cart>oxyl terminus of the peptide presentation structure. 
The randomized epression product region is expressed on the cell surface and presented to the 
extracellular space, such that it can bind to other surface molecules (affecting their function) or 

25 molecules preseiit in the extracellular medium. The binding of such molecules could confer function 
on the cells expressing a peptide that binds the molecule. The cytoplasmic region could be neutral or 
could contain a domain that, when the extracellular randomized expression product region is bound, 
confers a function on the cells (activation of a kinase, phosphatase, binding of other cellular 
components to effect function). Similariy. the randomized expression product-containing region could 

30 be contained within a cytoplasmic region, and the irsnsmembrane region and extrscellular region 
r* i.ain cons'.nnt or have a defined function. 

Membrane-anchoring sequences are well known in the art and are based on the genetic geometry of 
mammalian transmembrane molecules. Peptides are inserted into the membrane based on a signal 
sequence (designated herein as ssTM) and require a hydrophobic transmembrane domain (herein 
3 5 TM). The transmembrane proteins are inserted into the membrane such that the regions encoded 5* 
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of Ihe transmembrane domain are extracellular and the sequences 3* become intracellular. Of course, 
if these transmembrane domains are placed 5' of the variable region, they will serve to anchor it as an 
intracellular domain, which may be desirable in some embodiments. ssTMs and TMs are known for a 
wide variety of membrane bound proteins, and these sequences may be used accordingly, either as 
pairs from a particular protein or with each component being taken from a different protein, or 
alternatively, the sequences may be syntheUc. and derived entirely from consensus as artificial 
delivery domains. 

As will be appreciated by those in the art. membrane-anchoring sequences, including both ssTM and 
TM. are known for a wide variety of proteins and any of these may be used. Particulariy preferred 
membrane-anchoring sequences include, but are not limited to, those derived from CDS. ICAM-2, IL- 
8R.CD4andLFA-1. 



Useful sequences include sequences from: 1) class I integral membrane proteins such as IL-2 
receptor beta-chain (residues 1-26 are the signal sequence. 241-265 are the transmembrane 
residues: see Hatakeyama et al.. Science 244:551 (1989) and von Heijne et al. Eur. J. Biochem. 
174:671 (1988)) and insulin receptor p-chain (residues 1-27 are the signal. 957-959 are the 
transmembrane domain and 960-1382 are the cytoplasmic domain; see Hatakeyama. supra, and 
Eblna et al.. Cell 40:747 (1985)); 2) class II integral membrane proteins such as neutral endopeptidase 
(residues 29-51 are the transmembrane domain. 2-28 are the cytoplasmic domain; see Malfroy et al.. 
Biochem. Biophys. Res. Commun. 144:59 (1987)); 3) type III proteins such as human cytochrome 

2 0 P450 NF25 (Hatakeyama. supra): and 4) type IV proteins such as human P-glyooprotein 

(Hatakeyama, supra). Particulariy preferred are CDS and ICAM-2. For example, the signal 
sequences from CDS and ICAM-2 lie at the extreme 5' end of the transcript. These consist of the 
amino adds 1-32 in the case of CDS (MASPLTRFLSLNLLLLGESILGSGEAKPQAP; Nakauchi etal., 
PNAS USA 82:5126 (1985) and 1-21 in the case of ICAM-2 (MSSFGYRTLTVALFTLICCPG; Staunton 
25 et al.. Nature (London) 339:61 (1989)). These leader sequences deUver the constnict to the 

membrane while the hydrophobic transmembrane domains, placed 3* of the random peptide region, 
serve to anchor the constnjct in the membrane. These transmembrane domains are encompassed 
by amino acids 145-195 from CDS 

(PQRPEDCRPRGSVKGTGLDFACDIYIWAPLAGICVALLLSLKTLICYHSR: Nakauchi. supra) and 224- 

3 0 256 from ICAM-2 (MVIIvrA/SVLLCLFVTSVLLCFiFGOHLRQQR; Staunton, supra). 

Alternatively, membrane anchoring sequences include the GPI anchor, which results in a covalent 
bond between the molecule and the lipkj bilayer via a glycosyl-phosphaMdylinosilol bond for example In 
DAF (PNK6SGTTSGTTRLLSGHTCFTLTGLLGTLVTMGLLT. with the botded serine the site of the 
anchor see Homans et al.. Nature 333(61 70):269-72 (1988). and Moran et al.. J. Biol. Chem. 
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265:1250 (1991 )). In order to do this, the GPI sequence from Thy-1 can be cassetted 3' of the 
variable region in place of a transmembrane sequence. 

Similarly, myristylation sequences can serve as membrane anchoring sequences. It is known that the 
myristylation of c-src recruits it to the plasma membrane. This is a simple and effective method of 
5 membrane localization, given that the first 14 amino acids of the protein are solely responsible for this 
function: MGSSKSKPKDPSQR (see Cross et al.. MoL Cell. Biol, 4(9):1834 (1984); Spencer et al.. 
Science 262:1019-1024 (1993), both of which are hereby incorporated by reference). This motif has 
already been shown to be effective in the localization of reporter genes and can be used to anchor the 
zeta chain of the TOR. This motif Is placed 5* of the variable region in order to localize the construct to 
10 the plasma membrane. Other modifications such as palmitoylation can be used to anchor constructs 
In the plasma membrane: for example, palmitoylation sequences from the G protein-coupled receptor 
kinase GRK6 sequence (LLQRLFSRQDCCGNCSDSEEELPTRL. with the bold cysteines being 
palmitolyated; Stoffel et al.. J. Biol. Chem 269:27791 (1994)); from rhodopsin 

(KQFRNCMLTSLCCGKNPLGD; Barnstable et al., J. MoL Neurosci. 5(3):207 (1994)); and the p21 H- 
15 fas 1 protein (LNPPDESGPGCMSCKCVLS; Capon et al.. Nature 302:33 (1983)). 

In a preferred embodiment, the targeting sequence is a lysozomal targeting sequence, including, for 
example, a lysosomal degradation sequence such as Lamp-2 (KFERQ; Dice, Ann. N.Y. Acad. Sci. 
674:58 (1992); or lysosomal memt>rane sequences from l^mp-1 

(MUPIAGFFALAGLVUVUA VL /GRKRSHAGYQTt , Uthayakumar et al.. Cell. Mol. Biol. Res. 41 :405 
2 0 (1995)) or Lamp-2 (LVPIAVGAALAGVULVLLA VF/ GLKHHHAGYEQF . Konecki et la., Biochem. 

Biophys. Res. Comm. 205:1-5 (1994), t>oth of which show the transmembrane domains in Italics and 
the cytoplasmic targeting signal underiined). 

Altematively. the targeting sequence may be a mitrochondrial localization sequence, including 

mitochondrial matrix sequences (e.g. yeast alcohol dehydrogenase III; 
25 MLRTSSLFTRRVQPSLFSRNILRLQST; Schatz, Eur. J. Biochem. 165:1-6 (1987)); mitochondrial inner 

membrane sequences (yeast cytochrome c oxidase subunit IV; MLSLRQSIRFFKPATRTLCSSRYLL; 

Schatz. supra); mitochondrial tntermembrane space sequences (yeast cytochrome c1; 

MFSMLSKRWAQRTLSKSFYSTATGAASKSGKLTQKLVTAGVAAAGITASTLLYADSLTAEAMTA; 

Schatz. supra) or mitochondrial outer membrane sequences (yeast 70 kD ouier mc^fubrane protein; 
3 0 MKSFITRNKTAILATVAATGTAIGAYYYYNQLQQQQQRGKK; Schatz, supra). 

The target sequences may also be endoplasmic reticulum sequences, including the sequences from 
calreticulin (KDEL; Pelham. Royal Society London Transactions B; 1-10 (1992)) or adenovirus E3/19K 
protein (LYLSRRSFIDEKKMP; Jackson et al., EMBO J. 9:3153 (1990). 
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Furthermore, targeting sequences also include peroxisome sequences (for example, the peroxisome 
matrix sequence from Luciferase; SKL; Keller et al.. PNAS USA 4:3264 (1987)); famesylation 
sequences (for example. P21 H-ras 1; LNPPDESGPGCMSCKCVLS, with the bold cysteine 
famesylated; Capon, supra); geranylgeranylation sequences (for example, protein rab-5A; 
5 LTEPTQPTRNQCCSN, with the bold cysteines geranylgeranylated; Famsworth. PNAS USA 91 :1 1963 
(1994)); or destruction sequences (cydin B1 ; RTALGDIGN; Klotzbucher et al.. EMBO J. 1 :3053 
(1996)). 

In a preferred embodiment, the targeting sequence is a secretory signal sequence capable of effecting 
the secretion of the fusion polypeptide. There are a large number of known secretory signal 

10 sequences which are placed 5' to the variable peptide region, and are cleaved from the peptide region 
to effect secretion into the extracellular space. Secretory signal sequences and their transferability to 
unrelated proteins are well known. e.g.. Silhavy. et al. (1985) Microbiol. Rev. 49. 398-418. This is 
particulariy useful to generate a peptide capable of binding to the surface of, or affecting the 
physiology of, a target cell that is other than the host cell, e.g., the cell infected with the retrovirus. In a 

15 preferred approach, a fusion product is configured to contain, in series, secretion signal peptide- 

presentation structure-randomized expression product region-presentation structure. In this manner, 
target cells grown in the vicinity of cells caused to express the library of peptides, are bathed in 
secreted peptide. Target cells exhibiting a physiological change in response to the presence of a 
peptide, e.g., by the peptide binding to a surface receptor or by being internalized and binding to 

20 intracellular targets, and the secreting celts are localized by any of a variety of selection schemes and 
the peptide causing the effect determined. Exemplary effects include variously that of a designer 
cytokine (i.e., a stem cell factor capable of causing hematopoietic stem cells to divide and maintain 
their totipotential). a factor causing cancer cells to undergo spontaneous apoptosis. a factor that binds 
to the cell surface of target cells and labels them specifically, etc. 

25 Suitable secretory sequences are known, including signals from IL-2 (MYRMQIXSCIALSLALVTNS; 
Villinger et al., J. Immunol. 155:3946 (1995)), growth hormone 

(MATGSRTSLLLAFGLLCLPWLQEGSAFPT; Roskam et al.. Nucleic Acids Res. 7:30 (1979)); 
preproinsulin (MALWMRLLPLLALLALWGPDPAA AFVN : Bell et al.. Nature 284:26 (1980)); and 
influenza HA protein (MKAKLLVLLYAFVAGDOl: Sekiwawa et al.. PNAS 80:3563)), with cleavage 
3Cf i:c*lv.«jii the non-undejiiae- uudc-riined jL* *ction. A peiiticuia.:/ preferreJ secretory signal sequence L 
the signal leader sequence from the secreted cytokine IL-4. which comprises the first 24 amino acids 
of iL-4 as follows: MGLTSQLLPPLFFLLACAGNFVHG. 

In a preferred emt>odiment, the fusion partner is a rescue sequence. A rescue sequence is a 
sequence which may be used to purify or isolate either the peptide or the nucleic acid encoding it. 
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Thus, for example, peptide rescue sequences include purification sequences such as the HiSg tag for 
use with Ni affinity columns and epitope tags for detection, immunoprecipitation or FACS 
(fluoroscence-activated cell sorting). Suitable epitope lags include myc (for use with the commercially 
available 9E10 antibody), the BSP biotinylation target sequence of the bacterial enzyme BIrA, flu tags. 
lacZ, GST, and Strep tag I and II. 

Alternatively, the rescue sequence may be a unique oligonucleotide sequence which serves as a 
probe target site to allow the quick and easy isolation of the retroviral construct, via PCR, related 
techniques, or hybridization. 

In a preferred embodiment, the fusion partner Is a stability sequence to confer stability to the peptide 
or the nucleic acid encoding it. Thus, for example, peptides may be stabilized by the incorporation of 
glycines after the initiation methionine (MG or MGGO). for protection of the peptide to ubiquitination as 
per Varshavsk/s N-End Rule, thus conferring long half-life in the cytoplasm. Simiiariy. two prolines at 
the C-tenninus impart peptides that are largely resistant to cart)oxypeptidase action. The presence of 
two glycines prior to the prolines Impart bo\h flexibility and prevent structure initiating events in the di- 
proline to be propagated into the peptide structure. Thus, preferred stability sequences are as follows: 
M6(X)„GGPP, where X is any amino acid and n is an integer of at least four 

The fusk>n partners may be placed anywhere (i-e. N-terminal, C-terminal, intemal) in the structure as 
the biology and activity permits. In addition, while the discussion has been directed to the fusion of 
fusion partners to the peptide portton of the fusk3n polypeptide, it is also possible to fuse one or more 
of these fusion partners to the p- or rGFP portion of the fusion polypeptide. Thus, for example, the p- 
or rGFP may contain a targeting sequence (either N-terminally. C-termlnally. or internally, as 
described l)elow) at one location, and a rescue sequence in the same place or a different place on the 
molecule. Thus, any combination of fusion partners and peptides and p- or rGFP proteins may be 
made. 

In a preferred eml>odiment, the fusion partner includes a linker or tethering sequence. Linker 
sequences between various targeting sequences (for example, membrane targeting sequences) and 
the other components of the constructs (such as the randomized peptides) may be desirable to allow 
the peptides to interact with potential targets unhindered. For example, useful linkers include glycine 
polymers (G)„, glycine-serine polymers (including, for example, (GS)„, (GSGGS)„ and (GGGS)„, where 
n IS an integer of at least one), glycine-alanine polymers, alanine-serine polymers, and other flexible 
linkers such as the tetiier for the shaker potassium channel, and a large variety of ottier flexible 
linkers, as will l>e appreciated by those in the art. Glycine and glycine-serine polymers are preferred 
since both of these amino acids are relatively unstructured, and therefore may be able to serve as a 
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neutral tether between components. Glycine polymers are the most preferred as glycine accesses 
significantly more phi-psi space than even alanine, and is much less restricted tan residues with longer 
side chains (see Scheraga. Rev. Computational Chem. 11173-142 (1992)). Secondly, serine is 
hydrophilic and therefore able to solubilize what could be a globular glycine chain. Third, similar 
chains have been shown to be effective in joining subunits of recombinant proteins such as single 
chain antibodies. 

In a preferred embodiment, the peptide is connected to the p- or rGFP via linkers. That is. while one 
embodiment utilizes the direct linkage of the peptide to the rGFP or pGFP or of the peptide and any 
fusion partners to the p- or rGFP protein, a preferred embodiment utilizes linkers at one or both ends 
of the peptide. That is, when attached either to the or C-terminus, one linker may be used. When 
the peptide is inserted in an internal position, as is generally outlined below, preferred embodiments 
utilize at least one linker and preferably two, one at each terminus of the peptide. Linkers are 
generally preferred in order to conformationally decouple any insertion sequence (i.e. the peptide) 
from the scaffold structure itself, to minimize local distortions in the scaffold structure that can either 
destabilize folding intermediates or allow access to p- or rGFP's buried tripeptide fluorophore, which 
decreases (or eliminates) p- or rGFP's fluorescerK:e due to exposure to exogeneous collisional 
fluorescence quenchers (see Phillips, Curr. Opin. Structural Biology 7:821 (1997), hereby incorporated 
by reference in its entireity). 

Accordingly, as outlined below, when the peptides are inserted into internal positions in the p- or rGFP 
protein, preferred emt>odiments utilize linkers, and preferably (gly)n linkers, where n is 1 or more, with 
n being two, three, four, five and six. although linkers of 7-10 or more amino acids are also possible. 
Generally in this embodiment, no amino acids with |3<-carbons are used in the linkers. 

In addition, the fusion partners, including presentation structures, may t>e modified, randomized, 
and/or matured to alter the presentation orientation of the randomized expression product For 
example, determinants at the base of the loop may be modified to siightiy modify the internal loop 
peptide tertiary structure, which maintaining the randomized amino add sequence. 

In a preferred embodiment, combinations of fusion partners are used. Thus, for example, any number 
of combinations of presentation structures, targeting sequences, rescue sequences, and stability 
sequences may be used, witii or without linker sequences. As will be appreciated by those in the art, 
using a base vector that contains a cloning site for receiving random and/or biased libraries, one can 
cassette in various fusion partners 5' and 3' of the library. In addition, as discussed herein, it is 
possible to have more than one variable region in a construct, either to together form a new surface or 
to bring two other molecules together. Similariy, as more fully outiined below, it is possible to have 
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peptides inserted at two or more different loops of the p- or rGFP protein, preferably but not required 
to be on the same "face* of the p- or rGFP protein. 

The invention further provides fusion nucleic acids encoding the fusion polypeptides of the invention. 
As will be appreciated by those in the art, due to the degeneracy of the genetic code, an extremely 
5 large number of nucleic acids may be made, all of which encode the fusion proteins of the present 
invention. Thus, having identified a particular amino acid sequence, those skilled in the art could 
make any number of different nucleic acids, by simply modifying the sequence of one or more codons 
in a way which does not change the amino acid sequence of the fusion protein. 

The present invention has specifically contemplated each and every possible variation of 
1 0 polynucleotide that could be made by selecting combinations based on the possible codon choices, 
and all such variations are to be considered specifically disclosed and equivalent to the sequences of 
Figures 2 and 3. Codons are preferably selected to fit the host cell in which the enzyme is being 
produced: that Is, codon usage for yeast is used to express in yeast; codon usage for mammalian 
cells is used to express in mammalian cells; etc. Selection of codons to maximize expression of 
1 5 proteins in a heterologous host Is a known technique. 

Using the nucleic adds of the present invention which encode a fusion protein, a variety of expression 
vectors are made. The expression vectors may be either self-replicating extrachromosomal vectors or 
vectors which integrate Into a host genome. Generally, these expression vectors include 
transcripth^nal and translational regulatory nucleic add operably linked to the nucleic add encoding the 

2 0 fusion protein. The term "control sequences" refers to DNA sequences necessary for the expression 

of an operably linked coding sequence in a particular host organism. The control sequences that are 
suitable for prokaryotes. for example, include a promoter, optionally an operator sequence, and a 
ribosome binding site. Eukaryotic cells are known to utilize promoters, poiyadenytation signals, and 
enhancers. 

25 

Nudeic add is "operably linked" when it is placed into a functional relationship with another nucleic 
add sequence. For example, DNA for a presequence or secretory leader is operably linked to DNA 
for a polypeptide if it is expressed as a preprotein that participates in the secretion of the polypeptide; 
a promoter o: enhancer is operably linked lo a coding sequence if it affectr. t;.- transcription of the 

3 0 sequence; or a ribosome binding site Is operably linked to a coding sequence if it is positioned so as to 

facilitate translation. Generally, "operably linked" means that the DNA sequences being linked are 
contiguous, and. In the case of a secretory leader, contiguous and in reading phase. However, 
enhancers do not have to be contiguous. Linking is accomplished by ligation at convenient restriction 
sites. If such sites do not exist, the synthetic oligonucleotide adaptors or linkers are used in 
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accordance with conventional practice. The transcriptional and translationa) regulatory nucleic acid 
will generally be appropriate to the host cell used to express the fusion protein; for example, 
transcriptional and translational regulatory nucleic acid sequences from Bacillus are preferably used to 
express the fusion protein in Bacillus, Numerous types of appropriate expression vectors, and 
5 suitable regulatory sequences are known in the art for a variety of host cells. 

In general, the transcriptional and translational regulatory sequences may include, but are not limited 
to. promoter sequences, ribosomal binding sites, transcriptional start and stop sequences, 
translational start and stop sequences, and enhancer or activator sequences. In a preferred 
embodiment, the regulatory sequences include a promoter and transcriptional start and stop 
10 sequences. 

Promoter sequences encode either constitutive or inducible promoters. The promoters may be either 
naturally occurring promoters or hybrid promoters. Hybrid promoters, which combine elements of 
more than one promoter, are also known in the art, and are useful in the present invention. In a 
preferred emtxxliment. the promoters are stror)g promoters, allowing high expression in cells. 
15 particulariy mammalian cells, such as the CMV promoter, particularty in combination with a Tet 
regulatory element 

In addition, the expression vector may comprise additional elements. For example, the expresston 
vector may have two repik:ation systems, thus allowing It to be maintained in two organisms, for 
example in mammalian or insect cells for expression and in a procaryotic host for cloning and 

2 0 amplification. Furthermore, for Integrating expression vectors, the expression vector contalris at least 

one sequence homologous to the host cell genome, and preferably two homologous sequences which 
flank the expression construct. The integrating vector may be directed to a specific locus in the host 
cell by selecting the appropriate homologous sequence for inclusion in the vector. Constructs for 
integrating vectors are well known in the art 

25 In addition, in a preferred embodiment, the expression vector contains a selectable marker gene to 
allow the selection of transformed host cells. Selection genes are well known in the art and will vary 
with the host cell used. 

A preferred expression vector system is a retroviral vector system such as is generally described in 
PCTAJS97/01019 and PCT/US97/01048, both of which are hereby expressly incorporated by 

3 0 reference. Preferred retroviral systems and constructs are also outlined below. 

The fusion nucleic acids are introduced into the cells for screening, as is more fully outlined below. By 
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"introduced into " or grammatical equivalents herein is meant that the nucleic acids enter the cells in a 
manner suitable for subsequent expression of the nucleic acid. The method of introduction is largely 
dictated by the targeted cell type, discussed below. Exemplary methods include CaPO^ precipitation, 
liposome fusion, lipofectin®. electroporaUon, viral infection, etc. The candidate nucleic acids may 
stably integrate into the genome of the host cell (for example, with retroviral introducUon, outlined 
below), or may exist either transiently or stably in the cytoplasm (i.e. through the use of traditional 
plasmids, utilizing standard regulatory sequences, selection markers, etc.). As many pharmaceutically 
important screens require human or model mammalian cell targets, retroviral vectors capable of 
Iransfecting such targets are preferred. 

The fusion proteins of the present invention are produced by culturing a host cell transformed with an 
expression vector containing nucleic acid encoding a fusion protein, under the appropriate conditions 
to induce or cause expression of the fusion protein. The conditions appropriate for fusion protein 
expression will vary with the choice of the expression vector and the host cell, and will be easily 
ascertained by one skilled in the art through rouUne experimentation. For example, the use of 
constitutive promoters in the expression vector will require optimizing the grov^h and proliferation of 
the host cell, while the use of an inducible promoter requires the appropriate growth conditions for 
induction. In addition, in some embodiments, the timing of the harvest is important For example, the 
bacutoviral systems used in insect cell expression are lytic viaises, and thus harvest time selection 
can be crucial for product yield. 

Appropriate host cells indude yeast, bacteria, archebacteria. fungi, and insect and animal cells, 
including mammalian cells. Of particular interest are DrosophUa melangaster cells, Saccharomyces 
cerevisiae and other yeasts. E. coli, Badilus subtilis. SF9 cells. C129 cells, 293 cells. Neurospora, 
BHK, CHO. COS, and HeLa cells, fibroblasts. Schwanoma cell lines, immortalized mammalian 
myeloid and lymphoid cell lines, Juricat cells, mast cells and other endocrine and exocrine cells, and 
neuronal cells. 

In a prefen-ed embodiment, the fusion proteins are expressed in mammalian cells. Mammalian 
expression systems are also known in the art, and include retroviral systems. A mammalian promoter 
is any DNA sequence capable of binding mammalian RNA polymerase and initiating the downstream 
(3*) transcrio*: '.n of a coding sequence for the fusion protein into mRNA. A promoter will h<vve a 
transcription initiating region, which is usually placed Oproximal to the 5' end of the coding sequence, 
and a TATA box. using a located 25-30 base pairs upstream of the transcription initiation site. The 
TATA box is thought to direct RNA polymerase II to begin RNA synthesis at the correct site. A 
mammalian promoter will also contain an upstream promoter element (enhancer element), typically 
located within 100 to 200 base paire upstream of the TATA box. An upstream promoter element 
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determines the rate at which transcription is initiated and can act in either orientation. Of particular 
use as mammalian promoters are the promoters from mammalian viral genes, since the viral genes 
are often highly expressed and have a broad host range. Examples include the SV40 earty promoter, 
mouse mammary tumor virus LTR promoter, adenovirus major late promoter, herpes simplex virus 
promoter, and the CMV promoter. 

Typically, transcription termination and polyadenylation sequences recognized by mammalian cells are 
regulatory regions located 3' to the translation stop codon and thus, together with the promoter 
elements, flank the coding sequence. The 3' terminus of the mature mRNA is formed by site-specific 
post-translational cleavage and polyadenylation. Examples of transcription terminator and 
polyadenlytion signals include those derived from SV40. 

The methods of introducing exogenous nucleic add into mammalian hosts, as well as other hosts, is 
well known In the art, and will vary with the host cell used. Techniques include dextran*mediated 
transfection, calcium phosphate precipitation, polybrene mediated transfection, protoplast fusion, 
electroporatlon, viral infection, encapsulation of the polynucleotide(s) in liposomes^ and direct 
microinjection of the DNA into nuclei. As outlined herein, a particulariy preferred method utilizes 
retroviral infection, as outlined In PCT US97/01019. incorporated by reference. 

As will be appreciated by those In the art the type of manrunalian cells used in the present invention 
can vary widely. Basically, any mammalian cells may be used, with mouse, rat, primate and human 
cells being particulariy prefenred, although as will be appredated by those in the art, modifications of 
the system by pseudotyping allows all eukaryotic cells to l>e used, preferably higher eukaryotes. As is 
more fully described below, a screen will t>e set up such that the cells exhibit a selectable phenotype in 
the presence of a bioactive peptide. As is more fully described below, cell types implicated in a wide 
variety of disease conditions are particulariy useful, so long as a suitable screen may be designed to 
allow the selection of cells that exhibit an altered phenotype as a consequence of the presence of a 
peptide within the ceil. 

Accordingly, suitable cell types include, but are not limited to. tumor cells of all types (particulariy 
melanoma, myeloid leukemia, carcinomas of the lung, breast, ovaries, colon, kidney, prostate, 
pancreas and testps), cardiomyocytes. endc'^elial colls, epithelial cells, lymphocytes (T-cell and B 
cell) . mast cells, eosinophils, vascular intimal cells, hepatocytes, leukocytes including mononuclear 
leukocytes, stem cells such as haemopoetic, neural, skin, lung, kidney, liver and myocyte stem cells 
(for use in screening for differentiation and de-differentiation factors), osteodasts, chondrocytes and 
other connective tissue cells, keratinocytes, melanocytes, liver cells, kidney cells, and adipocytes. 
Suitable cells also indude known research cells, including, but not limited to, Jurkat T cells, N1H3T3 
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cells, CHO. Cos. etc. See the ATCC cell line catalog, hereby expressly incorporated by reference. 

In one embodiment, the cells may be additionally genetically engineered, that is, contain exogeneous 
nucleic acid other than the fusion nucleic acid. 

In a prefen^ed embodiment, the fusion proteins are expressed in t>acterial systems. Bacterial 
expression systems are well known in the art. 

A suitable bacterial promoter is any nucleic acid sequence capable of binding bacterial RNA 
polymerase and initialing the downstream (3') transcription of the coding sequence of the fusion 
protein Into mRNA. A bacterial promoter has a transcription initiation region which is usually placed 
proximal to the end of the coding sequence. This transcription initiation region typically includes an 
RNA polymerase binding site and a transcription initiation site. Sequences encoding metabolic 
pathway enzymes provide pariiculariy useful promoter sequences. Examples include promoter 
sequences derived from sugar metabolizing enzymes, such as galactose, lactose and maltose, and 
sequences derived from biosynthetic enzymes such as tryptophan. Promoters from bacteriophage 
may also be used and are known in the art. in addition, synthetic promoters and hybrid promoters are 
also useful; for example, the tac promoter is a hybrid of the Up and iac promoter sequences. 
Furthermore, a bacterial promoter can include naturally occurririg promoters of non-bacterial origin 
that have the abiiity'to bind bacterial RNA polymerase and initiate transcription. 

In addition to a functioning promoter sequence, an efficient ribosome binding site is desirable. In E. 
coli, the ribosome binding site is called the Shine-Delgamo (SD) sequence and includes an initiation 
codon and a sequence 3-9 nucleotides in length located 3-11 nucleotides upstiream of \he initiation 
codon. 

The expression vector may also include a signal peptide sequence that provMes for secretion of the 
fusion protein In bacteria. The signal sequence typically encodes a signal peptide comprised of 
hydrophobic amino adds which direct tiie secretion of the protein from the cell, as is well known in the 
art. The protein is either secreted into the growth media (gram-positive bacteria) or into the 
periplasmic space, located between the inner and outer membrane of the cell (gram-negative 
bscteris^- 

The bacterial expression vector may also include a selectable marker gene to allow for the selection of 
bacterial strains that have been transformed. Suitable selection genes include genes which render the 
bacteria resistant to drugs such as ampicillin, chloramphenicol, erythromycin, kanamycln. neomycin 
and teUBcycline. Selectable mariners also include biosynthetic genes, such as those in the histidine, 
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tryptophan and leucine biosynthetic pathways. 



These components are assembled into expression vectors. Expression vectors for bacteria are well 
known In the art, and indude vectors for Bacillus subtiiis, E, co//. Streptococcus cremoris. and 
Streptococcus lividans, among others. 

The bacterial expression vectors are transformed into bacterial host cells using techniques well known 
in the art, such as calcium chloride treatment, eleclroporation, and others. 

In one embodiment, fusion proteins are produced In insect cells. Expression vectors for the 
transformation of insect cells, and In particular, baculovirus-based expression vectors, are well known 
in the art. 



10 In a prefen^ed embodiment, fusion protein is produced in yeast cells. Yeast expression systems are 

well known in the art. and include expression vectors for Sacchammyces cerevisiae. Candida albicans 
and C. mattosa. Hansenula polymorpha. Kluyveromyces fragilis and K. iactis, Pichia guiiierimondii and 
P. pastoris. Schizosaccharomycespombe, and Yanx>wia lipolytica. Preferred promoter sequences for 
expression In yeast include the Inducible GAL1.10 promoter, the promoters from alcohol 

15 dehydrogenase, enolase. giucokinase. glucoses-phosphate isomerase. glyceraldehyde-3.phosphate- 
dehydrogenase. hexoklnase. phosphofructoklnase. 3-phosphoglycerate mutase, pyruvate kinase, and 
the add phosphatase gene. Yeast selectable markers indude ADE2, HIS4, LEU2. TRP1. and ALG7. 
which confers resistance to tunicamycin; the neomydn phosphotransferase gene, which confers 
resistance to G418; and the CUP1 gene, which allows yeast to grow In the presence of copper Ions. 

20 In addition, the fusion polypeptides of the Invention may be further fused to other proteins, if desired, 
for example to increase expression. 

In one embodiment, the fusion nucleic adds, proteins and antibodies of the invention are labeled with 
a label other than the p- or rGFP protein. By "labeled" herein is meant that a compound has at least 
one element, isotope or chemical compound attached to enable the detection of the compound. In 
25 general, labels fall into three d.-sses: a) isotopic labels, which may be radioactive or heavy isotopes: 
I - i.v.munT l^'bels, w!.: : r - antibodies o- ^. ;ligens; and c) colored or fluoresc-n! cfy-s. The 
lat)els may be incorporated into the compound at any position. 

In a preferred embodiment, the fusion nucleic acids are introduced Into the cells to screen for peptides 
capable of altering the phenotype of a cell. 
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In a preferred embodiment, a first plurality of cells is screened. That is. the cells into which the fusion 
nucleic acids are introduced are screened for an altered phenotype. Thus, in this emlxxJiment. the 
effect of the bioactive peptide is seen in the same cells in which it is made: i.e. an autocrine effect. 

By a -plurality of cells' herein is meant roughly from about 10' cells to 10» or 10». with from 10* to 10» 
5 being preferred. This plurality of cells comprises a cellular library, wherein generally each cell within 
the library contains a member of the peptide molecular library, i.e. a different peptide (or nucleic acid 
encoding the peptide), although as will be appreciated by those in the art. some cells within the library 
may not contain a peptide, and some may contain more than species of peptide. When methods other 
than retroviral infection are used to introduce the candidate nucleic adds into a plurality of cells, the 

10 distribution of candidate nucleic acids within the Individual cell members of the cellular library may vary 
widely, as it is generally difficult to control the number of nucleic acids which enter a cell during 
electfoporation. etc. Thus, in a preferred embodiment, libraries of fusion polypeptides comprising p- 
or rGFP proteins and random peptides are made: that is. a library of random peptides is used to 
generate a library of fusion polypeptides (and thus a library of fusion polynucleotides encoding the 

15 fusion polypeptides). 

In a preferred embodiment, the fusion nucleic adds are introduced into a first plurality of cells, and the 
effect of the peptide is screened in a second or third plurality of cells, different from the first plurality of 
cells, le. generally a different cell type. That is. the effect of the bioactive peptide is due to an 
extracellular effect on a second ceH: l.e. an endocrine or paracrine effect. This is done using standard 
2 0 techniques. The first plurality of cells may be grown in or on one media, and the media is allowed to 
touch a second plurality of cells, and the effect measured. Altematively. there may be direct contact 
between the cells. Thus, 'contacting" is functional contact, and indudes both direct and indirect In 
ttiis embodiment, the first plurality of cells may or may not be screened. 

If necessary, the cells are treated to conditions suitable for the expression of the peptide (for example. 

2 5 when indudble promoters are used). 

Thus, ttie methods of the present invention comprise introducing a molecular library of fusion nudeic 
adds encoding randomized peptides fused to scaffold into a plurality of cells, a cellular library. Each 
of the nudeic adds comprises a different nucleotide sequence encoding scaffold with a random 
peptide. The plurality of cells is then screened, as is more fully outlined below, for a cell exhibiting an 

3 0 altered phenotype. The altered phenotype is due to ttie presence of a bioactive peptide. 

By "altered phenotype" or "changed physiology" or other grammatical equivalents herein is meant that 
the phenotype of the cell is altered in some way. preferably in some detectable and/or measurable 
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way. As will be appreciated in the art. a strength of the present invention is the wide variety of cell 
types and potential phenotypic changes which may be tested using the present methods. Accordingly, 
any phenotypic change which may be observed, detected, or measured may be the basis of the 
screening methods herein. Suitable phenotypic changes include, but are not limited to: gross physical 
5 changes such as changes in cell morphology, cell growth, cell viability, adhesion to substrates or other 
cells, and cellular density; changes in the expression of one or more RNAs, proteins, lipids, hormones, 
cytokines, or other molecules; changes in the equilibrium state (i.e. half-life) or one or more RNAs, 
proteins, lipids, hormones, cytokines, or other molecules; changes in the localization of one or more 
RNAs, proteins, lipids, hormones, cytokines, or other molecules; changes in the bioactivity or specific 

1 0 activity of one or more RNAs, proteins, lipids, hormones, cytokines, receptors, or other molecules; 

changes in the secretion of ions, cytokines, hormones, growth factors, or other molecules: alterations 
in cellular membrane potentials, polarization, integrity or transport; changes in infectivity, 
susceptability, latency, adhesion, and uptake of viruses and bacterial pathogens; etc. By "capable of 
altering the phenotype' herein is meant that the bioactive peptide can change the phenotype of the cell 

15 In some detectable and/or measurable way. 

The altered phenotype may be detected in a wide variety of ways, as is descrit>ed more fully below, 
and will generally depend and correspond to the phenotype that is being changed. Generally, the 
changed phenotype is detected using, for example: microscopic analysis of cell morphology; standard 
cell viability assays. Including both increased celt death and increased cell viability, for example, cells 
that are now resistant to cell death via virus, bacteria, or bacterial or synthetic toxins; standard labeling 
assays such as fluorometric indicator assays for the presence or level of a particular cell or molecule, 
including FACS or other dye staining techniques; biochemical detection of the expression of target 
compounds after killing the cells; etc. In some cases, as Is more fully described herein, the altered 
phenotype is detected in the cell in which the fusion nucleic add was introduced; In other 
e'mtxxliments, the altered phenotype is detected in a second cell which is responding to some 
molecular signal from the first cell. 

An altered phenotype of a cell indicates the presence of a bioactive peptide, acting preferably in a 
transdominant way. By "transdominant" herein is meant that the bioactive peptide indirectly causes the 
altered phenotype by acting on a second molecule, which leads to an altered phenotype. That is, a 
3 0 transdominant expression prod. , has an effect that is not in cis, i.e.. a trans event as defined in 

genetic terms or biochemical terms. A transdominant effect is a distinguishable effect by a molecular 
entity (i.e., the encoded peptide or RNA) upon some separate and distinguishable target; that Is, not 
an effect upon the encoded entity itself. As such, transdominant effects include many well-known 
effects by pharmacologic agents upon target molecules or pathways in cells or physiologic systems; 
35 for instance, the p-lactam antibiotics have a transdominant effect upon peptidoglycan synthesis in 
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bacterial cells by binding to penicillin binding proteins and dismpting their functions. An exemplary 
transdominant effect by a peptide is the ability to inhibit NFh<B signaling by binding to iKB-a at a 
region critical for its function, such that in the presence of sufficient amounts of the peptide (or 
molecular entity), the signaling pathways that nomially lead to the activation of NF-kB through 
phosphorylation and/or degradation of IkB^ are inhibited from acting at iKB-a because of the binding 
of the peptide or molecular entity. In another instance, signaling pathways that are normally activated 
to secrete IgE are inhibited in ttie presence of peptide. Or. signaling pathways in adipose tissue cells, 
normally quiescent, are activated to metabolize fat. Or. in the presence of a peptide, intracellular 
mechanisms for the replication of certain viruses, such as HIV-I. or Herpes viridae family members, or 
Respiratory Syncytia Vims, for example, are inhibited. 

A transdominant effect upon a protein or molecular pathway is cleariy distinguishable from 
randomization, change, or mutation of a sequence within a protein or molecule of known or unknown 
function to enhance or diminish a biochemical ability ttiat protein or molecule already manifests. For 
instance, a protein ttiat enzymatically cleaves p-lactam antibiotics, a P-lactamase. could be enhanced 
or diminished in its activity by mutating sequences intemal to its stmcture that enhance or diminish ttie 
ability of this enzyme to act upon and cleave P-lactam antibiotics. This would be called a ds mutation 
to ttie protein. The effect of this protein upon p-lactam antibiotics is an activity the protein already 
manifests, to a distinguishable degree. Similarly, a nruitation in ttie leader sequence ttiat enhanced ttie 
export of ttiis protein to ttie extracellular spaces wherein It might encounter P-lactam molecules more 
readily, or a mutation v«tt»in ttie sequence tiiat enhance ttie stability of ttie protein, would be tenned 
ds mutations hi ttie protein. For comparison, a transdominant effector of ttiis protein would include an 
agent, independent of ttie P-lactamase. tiiat bound to the p-lactamase in such a way ttiat It enhanced 
or diminished ttie function of tiie P-lactamase by virtue of its binding to p-lactamase. 

In a preferred embodiment, once a ceU wltti an altered phenotype Is detected, ttie presence of ttie 
fusion protein is verified, to ensure ttiat ttie peptide was expressed and ttius ttiat ttie altered phenotype 
can be due to tiie presence of ttie peptide. As will be appredated by those In ttie art. ttiis verification 
of ttie presence of Uie peptide can be done elUier before, during or after the screening for an altered 
phenotype. This can be done in a variety of ways, alttiough preferred methods utilize FACS 
techniques. 

once the presence o( the fusion protein is verified, ttie cell witti the altered phenotype is generally 
isolated from the plurality whldi do not have altered phenotypes. This may be done In any number of 
ways, as Is known in ttie art. and will in some Instances depend on ttie assay or screen. Suitable 
isolation techniques indude. but are not limited to. FACS. lysis selection using complement, cell 
doning. scanning by Fluorimager. expression of a "survival" protein, induced expression of a cell 
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surface protein or other molecule that can be rendered fluorescent or taggable for physical Isolation; 
expression of an enzyme that changes a non-fluorescent molecule to a fluorescent one; overgrowth 
against a background of no or slow growth; death of cells and isolation of DNA or other cell vitality 
indicator dyes. etc. 

5 In a preferred embodiment, the fusion nucleic acid and/or the bioactive peptide (l.e. the fusion protein) 
is isolated from the positive cell. This may be done in a number of ways. In a preferred embodiment, 
primers complementary to DNA regions common to the retroviral constructs, or to specific 
components of the library such as a rescue sequence, defined above, are used to "rescue" the unique 
random sequence. Alternatively, the fusion protein is isolated using a rescue sequence. Thus, for 
10 example, rescue sequences comprising epitope tags or purification sequences may be used to pull 

out the fusion protein using Immunoprecipitation or affinity columns. In some instances, as is outlined 
below, this may also pull out the primary target molecule, if there is a sufficiently strong binding 
interaction between the bioactive peptide and the target molecule. Alternatively, the peptide may be 
detected using mass spectroscopy. 

15 Once rescued, the sequence of the bioactive peptide and/or fusion nucleic acid is determined. This 
information can then be used in a number of ways. 

In a preferred embodiment, the bioactive peptide is resynthesized and reintroduced into the target 
cells, to verify the effect This may be done using retroviruses, or alternatively using fusions to the 
HIV-1 Tat protein, and analogs and related proteins, which allows very high uptake into target cells. 
20 See for example, Fawell et al.. PNAS USA 91:664 (1994); Frankel et al.. Cell 65:1 189 (1988); Savion 
etal., J. Biol. Chem. 256:1149 (1981); Derossi et al.. J. Biol. Chem. 269:10444 (1994); and Baldin et 
al., EMBO J. 9:151 1 (1990), all of which are incorporated by reference. 

In a prefened embodiment, the sequence of a t>ioactive peptide is used to generate more candidate 
peptides. For example, the sequence of the bioactive peptide may be the basis of a second round of 

25 (biased) randomization, to develop bioactive peptides with increased or altered activities. 

Alternatively, the second round of randomization may change the affinity of the bioactive peptide. 
Furthermore, it may be desirable to put the identified random region of the bioactive peptide into other 
r-:s?ntation structures, or to alter the seq?jence of the conct-.nt region of the presentation structure, to 
alter the conformation/shape of the bioactive peptide. It may also be desirable to ^^waik" around a 

3 0 potential binding site, in a manner similar to the mutagenesis of a binding pocket, by keeping one end 
of the ligand region constant and randomizing the other end to shift the binding of the peptide around. 

In a prefen^ed embodiment, either the bioactive peptide or the bioactive nucleic add encoding It is 
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used to identify target molecules, i.e. the molecules with which the bioacUve peptide interacts. As will 
be appreciated by those in the art, there may be primary target molecules, to which the bioactive 
peptide binds or acts upon directly, and there may be secondary target molecules, which are part of 
the signalling pathway affected by the bioactive peptide; these might be temied "validated targets". 

5 In a preferred embodiment, the bioactive peptide is used to pull out target molecules. For example, as 
outlined herein, if the target molecules are proteins, the use of epitope tags or purification sequences 
can allow the purification of primary target molecules via biochemical means (co-immunoprecipitation, 
affinity columns, etc.). Alternatively, the peptide, when expressed in bacteria and purified, can be 
used as a probe against a bacterial cDNA expression library made from mRNA of the target cell type. 

10 Or, peptides can be used as "bair In either yeast or mammalian two or three hybrid systems. Such 
interaction cloning approaches have been very useful to isolate DNA-blnding proteins and other 
Interacting protein components. The peptide(s) can be combined with other phanmacologic activators 
to study the epistatic relationships of signal transduction pathways in question. It is also possible to 
synthetically prepare labeled peptide and use it to screen a cDNA library expressed in bacteriophage 

15 for those cDNAs which bind the peptide. Furthennore, it is also possible that one could use cDNA 
cloning via retroviral libraries to "complement" the effect induced by the peptide. In such a strategy, 
the peptide would be required to be stochiometrically titrating away some important factor for a 
specific signaling pathway. If this molecule or activity is replenished by over-expression of a cDNA 
from within a cDNA library, then one can clone the target. Similariy. cDNAs cloned by any of the 

20 above yeast or bacteriophage systems can be reintroduced to mammalian cells in this manner to 
confirm that they act to complement function in the system the peptide acts upon. 

Once primary target molecules have been identified, secondary target molecules may be identified in 
the same manner, using the primary target as the "bait". In this manner, signalling pathways may be 
elucidated. Simifarty. bioactive peptides specific for secondary target molecules may also be 

2 5 discovered, to allow a number of bioactive peptides to act on a single pathway, for example for 

combination therapies. 

The screening methods of the present invention may be useful to screen a large number of cell types 
under a wide variety of conditions. Generally, the host cells are cells that are involved In disease 
states, and they are tested c r screened under conditions that normaHv result in undesirable 

3 0 consequences on the cells. When a suitable bioactive peptide is found, the undesirable effect may be 

reduced or eliminated. Alternatively, normally desirable consequences may be reduced or eliminated, 
with an eye towards elucidating the cellular mechanisms associated with the disease state or 
signalling pathway. 
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In this way. fusion polypeptides comprising p- or rGFP proteins and random peptides are made for 
screening of the random peptides for bioactivity. 

Attematrvely. the present invention provides additional fusion constructs incorporating p- or rGFP. 
However, in this embodiment, the p- or rGFP protein is fused, in a number of ways as are described 
5 herein, to a gene or regulatory element of interest. 

In a prefen^ed embodiment, the p- or rGFP can be used to evaluate, test and screen promoters. Thus, 
in this embodiment, the invenUon provides compositions comprising a promoter of interest and a gene 
encoding a p- or rGFP. Preferably the promoter is not the native p- or rGFP promoter. 

In a preferred embodiment, the InvenUon relates to methods that rely on p- or rGFP genes fused to 
1 0 IgE promoters, such as the IL-4 inducible c promoter that starts a cascade that ultimately results in 

IgE production, as Is generally described in U.S.S.N. 09/076.624. hereby Incorporated by reference in 
its enUrety. Using novel reporter constructs, screening for modulators of this promoter system may be 
done, which can be used to screen for upstream modulators of IgE production, to prevent the 
production of IgE and thus reduce or eliminate an allergic response. For example, an early step in the 
15 Ig switch is the production of sterile e transcripts in response to IL-4. It is also appreciated that 

blockage of the production of membrane bound IgE may induce programmed cell death (PCD). By 
Interfering at this step, highly efficient, rapid and prolonged inhibition of the allergic response may 
occur. In addition, these techniques allow individual cell assessment and thus are useful for high- 
throughput screening strategies, for example those that utilize fluorescence activated cell sorting 
2 0 (FACS) techniques, and thus allow screening of large numbers of compounds for their effects on IgE 
production. 



Thus in a preferred embodiment the invenUon provides a number of different constructs that allow for 
screening for antagonists and agonists of these promoters. 

In a preferred embodiment, the invention provides methods of screening for bioac«ve agents capable 
of modulating, particulariy inhibiting, an IL-4 inducible e promoter. By "an IL-4 inducible promoter" 
herein is meant a nucleic acid promoter th?? is induced by IL-4. putatively by binding an unknown IL-A 
induced DNA binding protein that results in induction of the promoter, that is. the introduction of IL-4 
causes me pronounced activation of a particular DNA binding protein that then binds to the IL-4 
inducible promoter segment and induces transcription. The sequence of the human IL^ inducible 
promoter is shown In Figure 1 of U.S.S.N. 09/076.624. hereby expressly incorporated by reference in 
its entirety, and as will be appreciated by those in the art. derivatives or mutant promoters are included 
within this definition. Particulariy included within the definiUon of an IL-4 inducible promoter are 



BNSOOCID: <WO. 



.0134a24A2J_> 



wo 01/34824 



-34- 



PCTA)S00/3C(915 



fragments or deletions of the sequence shown in Figure 1 of U.S.S.N. 09/076.624. As is known in the 
art. the IL-4 inducible promoter is also inducible by IL-13. By "modulating an IL-4 inducible promoter" 
herein is meant either an increase or a decrease (inhibition) of promoter activity, for example as 
measured by the presence or quantification of transcripts or of translation products. By 'inhibiting an 
5 IL-4 inducible promoter" herein is meant a decrease in promoter activity, with changes of at least 
about 50% being preferred, and at least about 90% being particularly preferred. 

The methods comprise combining a candidate bioactive agent and a cell or a population of cells 
comprising a fusion nucleic acid. The cell or cells comprise a fusion nucleic acid. In a preferred 
embodiment, the fusion nucleic acid comprises an IL-4 inducible e promoter and at least a p- or rGFP 
.0 gene. The IL-4 inducible e promoter is as described herein, or derivatives thereof, and may be either 
an endogeneous or exogeneous IL-4 inducible c promoter, as is more fully described below. 

In a preferred embodiment, constructs comprising a promoter and two reporter genes can be made. 
In this embodiment, the first reporter gene is a p- or rGFP gene. The second reporter gene is a death 
gene that provides a nucleic acid that encodes a protein that causes the cells to die. Death genes fall 
into two t)asic categories: death genes that encode death proteins that require a death iigand to kill the 
cells, and death genes that encode death proteins that kill cells as a result of high expression within 
the cell, and do not require the addition of any death Iigand. It is preferable that cell death requires a 
two-step process: the expression of the death gene and induction of the death phenotype with a signal 
or Iigand, such that the cells may be grown up expressing the death gene, and then induced to die. A 
number of death genes/ligand pairs are known, including, but not limited to. the Fas receptor and Fas 
Iigand (Bodmer. et al.. "Characterization of Fas," J Biol Chem 272(30):1 8827-1 8833 (Jul 25, 1997): 
muFAS. Gonzalez-Cuadrado, et aL. "AgonisUc anti-Fas Antibodies Induce Glomerular Cell Apoptosis 
in Mice In Vivo." Kidney Int 51(6):1739-1746 (Jun 1997); f^umva. et al.. Hum Gene Ther, 8(8):955 
(May 1997)). (or anti-Fas receptor antibodies); p450 and cyclophosphamide (Chen, et al., "Potentiation 
of Cytochrome P450/Cyclophosphamide-Based Cancer Gene Therapy By Coexpression of the P450 
Reductase Gene," Cancer Res 57(21 ):4830-4837 (Nov 1 1997)); thymidine kinase and gangcylovir 
(Stone, R., 'Molecular 'Surgery' For Brain Tumors." 256(5063):1513 (June 12. 1992)). tumor necrosis 
factor (TNF) receptor and TNF. Altematively. the death gene need not require a Iigand. and death 
results from high expression of the gene; for example, the overexpression of a number of 
programmed cell death (PCD) proteins are kno^'-i to cause ceU death, includinn. but not limited to. 
caspases, bax. 1 KADD. FADD. SCK, MEK. eic. 

In addition to the IL-4 inducible e promoter, other promoters of interest can be used. The promoter of 
interest can be either a constitutive promoter or an inducible promoter, such as the IL-4 inducible e 
promoter. As will be appreciated by those in the art. any number of possible promoters could be used. 
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Suitable promoters of interest include, but are not limited to. inducible promoters sucti as IL-4 e 
promoter, promoters that are induced by cytokines or growth factors such as the interferon responsive 
factors 1 to 4, NFkB (Ftering, et al., "Single Cell Assay of a Transcription Factor Reveals a Threshold 
in Transcription Activated By Signals Emanating From the T-Cell Antigen Receptor," Genes Dev 
5 4(10): 1823-1 834 (Oct 1990)), promoters activated by heavy metals, heat shock promoters, stress 

promoters, etc. When inducible promoters are used in this embodiment, suitable cell types are those 
that can be induced by the appropriate inducer, as will be appreciated by those in the art. Constitutive 
promoters are also of use. particularty tissue specific promoters, including, but not limited to. CNS. 
PNS. brain, kidney, skin, bone, lung, heart, liver, bladder, ovary, testes, colon, etc. specific promoters. 

10 In a preferred embodiment, the promoter of interest is a constitutive promoter, and it is hooked to a 
death gene that requires the presence of a ligand, such as Fas or TNF. Thus, the cells can be grown 
up and the presence of the death gene verified due to the constitutive promoter. This is done by 
hooking the death gene up to a p- or rGFP gene, using either an IRES or a protease cleavage site as 
is outlined below: thus, the presence of the p- or rGFP gene means the death gene is also present 

15 Verification of the presence of the death gene is preferred to keep the levels of false positives tow; that 
is, cells that survive the screen should be due to the presence of an inhibitor of the promoter rather 
than a lack of the death gene. 

Once the cells have been enriched for those containing the death gene, the candidate agents can be 
added (and their presence verified as well), followed by induction in the presence of IL-4, and finally by 
20 addition of the death ligand. Thus, the cell population is enriched for those cells that have an agent 
that inhibits the promoter and thus does not produce the death protein, i.e. those that survive. 

When death genes that require ligands are used. i.e. for "two step" processes, preferred embodiments 
utilize chimeric death genes. i.e. chimeric death receptor genes. These chimeric death receptors 
comprise the extracellular domain of a ligand-acUvated multimerizing receptor and the endogeneous 

25 cytosolic domain of a death receptor gene, such as Fas or TNF. This is done to avoid endogeneous 
activation of the death gene. The mechanism of Fas-induced cell death involves the Introduction of 
the Fas ligand, which can bind two monomeric Fas receptors, causing the multlmerization of the 
receptor, which activates the receptor and leads to secondary signalling resulting In caspase activation 
and PCD. However, as will be appreciated by those in the art. It is possible to sut - ..tyte the 

3 0 extracellular portion of the death receptor with the extracellular portion of another ligand-activated 
multimerizing receptor, such that a completely different signal activates the cell to die. There are a 
number of known ligand-activated dimerizing receptors, including, but not limited to, the CD8 receptor, 
erythropoeitin receptor, thrombopoeitin receptor, growth hormone receptor. Fas receptor, platelet 
derived growth hormone receptor, epidermal growth factor receptor, leptin receptor, and a variety of 
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inlerteukin receptors (including, but not limited to. IL-1. IL-2, IL-S. IL-4. IL-5. IL-6, IL-?. IL-S. IL-9. 11-11. 
IL-12. IL-13, IL-15, and IL-17; although the use of the IL-4 and IL-13 receptors are not prefen-ed. since 
these can be used to Induce the promoter and thus does not provide a "two step" death process), low- 
density lipoprotein receptor, prolactin receptor, and transfenin receptor. 

In a preferred embodiment, chimeric Fas receptor genes are made. The exact combination will 
depend on the cell type used and the receptors normally produced by these cells. For example, when 
using human cells or cell lines, a non-human extracellular domain and a human cytosolic domain are 
prefen-ed. to prevent endogeneous Induction of the death gene. For example, a preferred 
embodiment utilizes human cells, a murine extracellular Fas receptor domain and a human cytosolic 
domain, such that the endogeneous human Fas ligand will not activate the murine domain. 
Alternatively, human extracellular domains may be used when the cells used do not endogeneously 
produce the ligand; for example, the human EPO extracellular domain may be used when the cells do 
not endogeneously produce EPO. (Kawaguchi. et al.. Cancer Lett., 116(1):53 (1997); Takebayashi. et 
al.. Cancer Res.. 56(18):4164 (1996); Rudert. et al., Biochem Biophys Res Commun,, 204(3):1102 
(1194); Rudert, et al., DMA Celt Biol.. 16(2):197 (1997); Takahasi, et al.. J Biol Cham. 271(29):17555 

(1996) ; Adam, et al.. J Biol C/iem.. 268(26):19882 (1993); Mares, et al.. Growth Factors, 6(2):93 
(1992); Seedorf. et al.. J Biol Chem., 266(19):12424 (1991 ); Heidaran. et al., J Biol Cham., 
265(31)118741 (1990); Okuda. et al., J Clin Invest. 100{7):1708 (1997); Allgood, et al.. CurGFP Opin 
BiotBChnoL, 8(4):474 (1997); Anders, et al.. J Biol Chem., 271(36):21758 (1996); Krishnan. et al.. 
Oncogene. 13(1):125 (1996); Declercq. et al.. Cytokine. 7(7):701 (1995); Bazzonl. et al.. Proc Natl 
Acad Sci US., 92(12):5380 (1995); Ohashi, et al.. Proc Natl Acad Sci USA, 91(1):168 (1994); 
Desai, et al.. Cell, 73(3):541 (1993); and Amara. et al., Proc Natl Acad Sd USA. 94(20):10618 

(1997) ). 

In addition to the extracellular domain and the cytosolic domain, these receptors have a 
transmembrane domain. As will be appreciated by those in the art. for chimeric death receptor genes, 
the transmembrane domain from any of the receptors can be used, although in general, it is prefen-ed 
to use the transmembrane domain associated with the chosen cytosolic domain, to preserve the 
Interaction of the transmembrane domain with other. endogeneous signalling proteins. 

Thus, preferred embodiments provide fusion nucleic acid*^ i -.at utilize the IL-4 inducible e promoter 
linked to a p- or rGFP gene and a death gene, particulariy a chimeric death receptor gene, that 
requires a death ligand for cell killing, particulariy with an IRES in between the reporter genes.. 

Alternatively, inducible promoters can be linked to "one step" death genes, i.e. death genes that upon 
a certain threshold expression, will kill a cell without requiring a ligand or secondary signal. In this 
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embodiment, the inducible promoter is preferably "leaky", such that some small amount of death gene 
and a required secondary reporter gene such as a survival gene or a detection gene can be 
expressed. The cells that contain the death gene can then be selected on this basis, to avoid false 
positives. Once the presence of the constmct is verified, candidate agents are added (and their 
presence preferably verified, using a detection or selection gene as well), and the promoter is induced. 
The population is then enriched for those cells that <»ntain agents that inhibit the promoter, i.e. that 
will survive. In this embodiment, a p- or rGFP gene is used, particularly when inducible death genes 
are used. The use of a p- or rGFP gene allows cells to be sorted to give a population enriched for 
those containing the construct. As outlined above,a preferred embodiment uses "leaky" inducible 
promoters: that is. the cells are selected such that the IL-4 inducible promoter, even in the absence of 
IL-4 or iL-13. produces some p- or rGFP and death gene (for example, the Fas receptor constructs). 
In this embodiment, suitably "leaky" promoters are chosen such that some p- or rGFP is expressed 
(preferably enough to select the ceils expressing the construct from those that are not), but not 
enough death gene is produced to cause death. While preferred embodiments utilize death genes 
requiring the addition of a death ligand. it is well known that high levels of some death genes, even^in 
the absence of death ligand. can cause death. Thus, for example, high levels of Fas receptor 
expression can cause multimerization. and thus activation, even in the absence of the Fas ligand. 

In a prefen«d embodiment, when two reporter genes are used, they are fused together In such a way 
as to only require a single promoter, and thus some way of functionally separating the two genes is 

2 0 preferred. This can be done on the RNA level or the protein level. Preferred embodiments utilize 

either IRES sites (which allows the translation of two different genes on a single transcript (Kim. et al.. 
•Constniction of a Bifunctional mRNA in the iUlouse By Using the Internal Ribosomal Entry Site of the 
Encephalomycarditis Virus," Molecular and Cellular Biology 12(8):3636-3643 (Aug 1992) and 
McBratney. et al.. "The Sequence Context of the InltiaUon Codon in the Encephalomycarditis Vims 
=25 Leader Modulates Efficiency of Internal Translation Initiatipn." Current Opinion in Cell Biology 5:961- 
965 (1993)), or a protease cleavage site (which cleaves a protein translation product into two 
proteins). Prefen-ed protease cleavage sites include, but are not limited to. the 2a site (Ryan et al.. J. 
Gen. Virol. 72:2727 (1991); Ryan et al., EMBO J. 13:928 (1994); Donnelly et al.. J. Gen. Virol. 78:13 
(1997): Hellen et al.. Biochem, 28(26):9881 (1989); and Ivlattion et al., J. Virol. 70:8124 (1996), all of 

3 0 which are expressly incorporated by reference), prosequences of retroviral proteases including human 

immunodeficiency vims orotease and sequences recognized and cleaved by trypsin (EP 578472, 
Takasuga et al.. J. Biochem. 1 12(5)652 (1992)) lactor X, (Gfcioella el al.. J. Biol. Chem. 
265(26):15854 (1990). WO 9006370). collagenase (J03280893. Tajima et al.. J. Ferment. Bioeng. 
72(5):362 (1991), WO 9006370), clostripain (EP 578472), subtitisin (including mutant H64A subtilisin, 
35 Forsberg et al.. J. Protein Chem. 10(5):517 (1991), chymosin. yeast KEX2 protease (Bourbonnais et 
al.. J. Bio. Chem. 263(30):15342 (1988). thrombin (Forsberg et al.. supra: Abath et al.. BloTechniques 
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10(2):178 (1991)). Staphylococcus aureus V8 protease or similar endoproleinase-Glu-C to cleave 
after Glu residues (EP 578472, IshizakI el al.. Appl. Microbiol. Biotechnol. 36(4):483 (1992)). cleavage 
by NIa proteainase of tobacco etch virus (Parks et al., Anal. Biochem. 216(2):413 (1994)), 
endoproteinase-Lys-C (U.S. Patent No. 4.414.332) and endoproteinase-Asp-N, Neisseria type 2 IgA 
5 protease (Pohlner et al.. Biorrechnology 10(7):799-804 (1992)). soluble yeast endoproteinase yscF 
(EP 467839). chymotrypsin (Altman et al.. Protein Eng. 4(5):593 (1991)). enteropeptidase (WO 
9006370). lysostaphin, a polyglycine specific endoproteinase (EP 316748). and the like. See e.g. 
Marston, F.A.O. (1986) Biol. Chenn. J. 240, 1-12. 

Thus, in preferred emtiodiment. fusion constructs comprising a gene of interest, an IRES site and an 
10 p- or rGFP gene are provided. 

In addition to the promoter of interest, such as an IL-4 inducible e promoter and p- or rGFP gene, the 
fusion nucleic acids may comprise additional components, including, but not limited to. other reporter 
genes, protein cleavage sites, internal ribosome entry (IRES) sites, AP-1 sites, and other components 
as will t>e appreciated by those in the art. 

15 In a prefen^d embodiment, foreign constructs comprising the IL-4 inducible e promoter and the p- or 
rGPP gene are made. By "foreign* herein is meant that the fusion nucleic acids originates outside of 
the cells. That is, a recombinant nucleic add Is made that contains an exogeneous IL-4 inducible e 
promoter and an p- or rGFP gene. Thus, in some circumstances, the cells will contain both 
exogeneous and endogeneous IL-4 inducible e promoters. By "recombinant nucleic add" herein is 

20 meant nucleic add, originally formed in vitro, in general, by the manipulation of nudeic acid by 

endonudeases, in a form not normally found in nature. Thus an isolated nudeic add. In a linear form, 
a nudeic add containing components not normally joined, such as an non-p- or rGFP promoter and 
an p- or rGFP gene, or an expression vector fomried in vitro by ligating DNA molecules that are not 
normally joined, are all considered recombinant for the purposes of this invention. It is understood that 

25 once a recombinant nudeic add is made and reintroduced into a host cell or organism, it will replicate 
non-recombinantly. i.e. using the in vivo cellular machinery of the host cell rather than in vitro 
manipulations; however, such nudeic adds, once produced recombinantly, although subsequently 
replicated non-recombinanlly. are still considered recombinant for the purposes of the invention. 

For the IL-4 inducible e promoter systems, any cells that express an IL-4 receptor that transduces the 
3 0 IL-4 signal to the nucleus and alters transcription can be used. Suitable cells indude, but are not 

limited to, human cells and cell lines that show IL-4/13 indudble production of germline e transcripts, 
including, but not limited to. DND39 (see Watanabe. supra). MC-1 16. (Kumar, et al.. "Human BCGF- 
12kD Functions as an Autocrine Growth Factor in Transformed B Cells." Eur Cytoldne Netw 1(2):109 
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(1990)). CA-46 (Wang, et aL, "UCN-01: A Potent Abrogator of G2 Checkpoint Function in Cancer 
Cells with Dirupted p53," J Natl Cancer Inst 88:956 (1996)). 

As for all the embodiments outlined herein, the recombinant nucleic acid (e.g. the fusion nucleic acids) 
may be introduced to a ceil in a variety of ways, as will be appreciated by those in the art, including. 
5 but not limited to. CaP04 precipitation, liposome fusion, lipofectin®. electroporation, viral infection, etc. 
The constructs may preferably stably integrate into the genome of the host cell (for example, with 
retroviral introduction, outlined below), or may exist either transiently or stably in the cytoplasm (i.e. 
through the use of traditional plasmids. utilizing standard regulatory sequences, selection markers, 
etc.). 

10 In a preferred embodiment, the exogeneous constructs, which may be in the form of an expression 
vector, are added as retroviral constructs, using techniques generally described in PCT US97/01019 
and PCT US97/01048. both of which are expressly incorporated by reference In their entirety. 

In a preferred embodiment, the fusion construct comprises an endogeneous promoter (such as an IL- 
4 inducible e promoter) and an exogeneous p- or rGFP gene; "endogeneous" in this context means 
15 originating within the celL That is. gene *knock-in" constructions are made, whereby an exogeneous 
p- or rGFP gene as outlined herein is added, via homologous recombination, to the genome, such that 
the reporter gene is under the control of the endogeneous promoter. This may be desirable to allow 
for the exploration and modulation of the full range of endogeneous regulation, i.e. regulatory 
elements (particulariy those flanking the promoter) other than just the promoter fragment. 

20 Homologous recombination may proceed in several ways. In one embodiment, traditional homologous 
recombination is done, with molecular biological techniques such as PCR being done to find the 
correct Insertions. For example, gene "knock-ins* nnay be done as is known in the art, for example 
see Westphal et al.. Current Biology 7:R530-R533 (1997), and references cited therein, all of which 
are expressly incorporated by reference. The use of recA mediated systems may also be done, see 

2 5 PCT US93/03868, hereby expressly incorporated by reference. 

Alternatively, and preferably, the selection of the "knock ins" are done by FACS on the basis of the 
incorporation of the p- or rGFP gene. Thus, in a preferred embodiment, a first homologous 
recombination event is done to put an p- or rGFP gene, into al least one allele of the cell genome. 
When the promoter Is the IL^ inducible promoter. preferat)ly, this is a cell type that exhibits IL-4 

3 0 inducible production of at least germline e transcripts, so that the cells may be tested by iL-4 

production for reporter gene expression. Suitable cells include, but are not limited to. human Cells and 
cell tines that show IL-4/13 inducible production of germline e transcripts, including, but not limited to. 
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DND39 (see Watanabe. supra). MC-116. (Kumar, et al.. "Human BCGF-l2kD FuncUons as an 
autocrine Growth Factor in Transfonned B Cells." Eur Cytokine Netw 1(2):109 (1990)). CA-46 (Wang, 
et al.. "UCN-OI :A Potent Abrogalor of G2 Checkpoint Function in Cancer Cells with Dinipted p53." J 
Natl Cancer Inst 88:956 (1996)). As is noted herein, the ability of MC-1 16 and CA.46 cells to produce 
germline e transcripts upon IL-4/13 induction was not known prior to the present invention. Thus, 
preferred embodiments provide MC-116 and/or CA-46 cells comprising recombinant nucleic acid 
reporter constructs are outlined herein. 

As will be appreciated by those in the art and outlined herein, any number of suitable cell types can be 
used in the present invention. 

In a preferred embodiment, once a first endogeneous promoter has been combined with an 
exogeneous reporter construct, a second homologous recombinaMon event may be done, preferably 
using a second reporter gene different from the first, to target the other allele of the cell genome, and 
tested as above. 

Generally. IL-4 induction of the p- or rGFP genes will indicate the con^d placement of the genes, 
wrtiich can be confimied via sequencing such as PCR sequencing or Southern blot hybridization. In 
addition, preferred embodiments utaize prescreening steps to remove "leaky" cells, l.e. those showing 
constitutive expression of the p- or rGFP gene. 

Thus, in a preferred embodiment, the Invention provides cell lines that contain fusion nucleic adds 
comprising IL-4 inducible e promoter operably connected to an p- or rGFP gene. Once made, the cell 
lines comprising these reporter constructs are used to screen candidate bioactive agents for the ability 
to modulate the production of IgE. as is outlined below. 

The term "candidate bioactive agent" or "exogeneous compound" as used herein describes any 
moiecule, e.g.. protein, oligopeptide, small organic molecule, polysaccharide, polynucleotide. 
Generally a plurality of assay mixtures are mn in parallel with different agent concentrations to obtain a 
differential response to the various concentrations. Typically, one of these concentrations serves as a 
negative control, /.e.. at zero concentration or below the level of detection. 

Candidate agents encompass numerous chemical classes, though typically they are organic 
molecules, preferably small organic compounds having a molecular weight of more than 100 and less 
than about 2.500 daltons. Candidate agents comprise functional groups necessary for stmctural 
interaction vinth proteins, partlcularty hydrogen bonding, and typically Include at least an amine. 
cart)onyl, hydroxyl or caiboxyi group, preferably at least two of the functional chemical groups. The 
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candidate agents often comprise cyclical carbon or heterocyclic structures and/or aromatic or 
polyaromatic structures substituted with one or more of the atK>ve functional groups. Candidate 
agents are also found among biomolecules including peptides, saccharides, fatty acids, steroids, 
purines, pyrimidines, derivatives, structural analogs or combinations thereof. Particularly preferred are 
peptides. 

Candidate agents are obtained from a wide variety of sources including libraries of synthetic or natural 
compounds. For example, numerous means are available for random and directed synthesis of a 
wide variety of organic compounds and biomolecules, including expression of randomized 
oligonucleotides. Alternatively, libraries of natural compounds in the form of bacterial, fungal, plant 
and animal extracts are available or readily produced. Additionally, natural or synthetically produced 
libraries and compounds are readily modified through conventional chemical, physical and 
biochemical means. Known pharmacological agents may be subjected to directed or random 
chemical modifications, such as acylation, alkylation. esterification. amidification to produce structural 
analogs. 

In a preferred emtxxiiment, the candidate bioactive agents are proteins. By "protein" herein is meant 
at least two covalently attached amino acids, which includes proteins, polypeptides, oligopeptides and 
peptides. The protein may be made up of naturally occurring amino acids and peptide bonds, or 
synthetic peptidomimetic structures. Thus "amino acid", or "peptide residue", as used herein means 
both naturally occuning and synthetic amino acids. For example, homo-phenylalanine, citrulline and 
noreleuclne are considered amino adds for the purposes of the invention. "Amino add" also indudes 
imino acid residues such as proline and hydroxyproline. The side chains may be in either the (R) or 
the (S) configuration. In the preferred embodiment, the amino acids are in the (S) or L-configuration. 
If non-naturally occum'ng side chains are used, non-amino add substituents may be used, for example 
to prevent or retard in vivo degradations. 

In a preferred embodiment, the candidate bioactive agents are naturally occuring proteins or 
fragments of naturally occuring proteins. Thus, for example, cellular extracts containing proteins, or 
random or directed digests of proteinaceous cellular extracts, may be used. In this way libraries of 
procaryotic and eucaryotic proteins may be made for screening in the systems described herein. 
Particulariy prefen-ed in this embodiment are libraries of bacterial, fungal, viral, and mammalian 
proteins, with the latter being preferred, and human proteins being especially preferred. 

In a preferred embodiment, the candidate bioactive agents are peptides of from about 5 to about 30 
amino acids, with from about 5 to about 20 amino adds being preferred, and from about 7 to about 15 
being particulariy preferred. The peptides may be digests of naturally occuring proteins as is outiined 
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above, random peptides, or 'biased" random peptides. By -randomized" or grammatical equivalents 
herein is meant that each nucleic acid and peptide consists of essentially random nucleotides and 
amino acids, respectively. Since generally these random peptides (or nucleic acids, discussed below) 
are chemically synthesized, they may incorporate any nucleotide or amino acid at any position. The 
synthetic process can be designed to generate randomized proteins or nucleic acids, to allow the 
formation of all or most of the possible combinations over the length of the sequence, thus forming a 
library of randomized candidate bioacUve proteinaceous agents. 

In one embodiment, the library is fully randomized, with no sequence preferences or constants at any 
position. In a preferred embodiment, the library is biased. That Is. some positions within the 
sequence are either held constant, or are selected from a limited number of possibilities. For 
example, in a preferred embodiment, the nucleotides or amino acid residues are randomized within a 
defined class, for example, of hydrophobic amino acids, hydrophilic residues, sterically biased (either 
small or large) residues, towards the creation of cysteines, for cross-linking, prolines for SH-3 
domains, serines, threonines, tyrosines or histidines for phosphorylation sites, etc.. or to purines, etc. 

In a preferred embodiment, the candidate bloactive agents are nucleic acids. By "nucleic add" or 
-oligonucleotide- or grammatical equivalents herein means at least two nucleotides covalentiy linked 
together. A nucleic acid of the present invention will generally contain phosphodiester bonds, although 
In some cases, as outilned below, nucleic acid analogs are included that may have alternate 
backbones, comprising, for example, phosphoramide (Beaucage. et at.. Tetrahedron. 49(10):1925 
(1993) and references therein; Letsinger. J. Pro. Chem.. 35:3800 (1970); Sprinzl. et a/.. EulJ» 
Biochem. . 81:579 (1977); Letsinger. et a/.. Nud. Acids Res.. 14:3487 (1986); Sawal. ef a/.. QbsiiL 
Lett. 805 (1984). Letsinger. ef a/.. ■ Am nh.>m. Soc.. 110:4470 (1988): and Pauwels. ef a/.. ShSDisa 
Scriota . 26:141 (1986)). phosphorothloate (Mag. ef a/.. N»(?leic Acids Res.. 19:1437 (1991): and U.S. 
Patent No. 5.644.048). phosphorodlthloate (Briu. ef a/.. J, Am- Chern- Soc.. 111^2321 (1989)). O 
methylphophoroamkJite linkages (see Eckstein. Oligonucleotides and Analogues: A Practical 
Approach. Oxford University Press), and peptide nucleic acid backbones and linkages (see Egholm. 
Am Chem.Soc. . 114:1895 (1992): iwleier. et al.. Chem. Int. Ed. Engl.. 31:1008 (1992): Nielsen. 
Nature . 365:566 (1993): Carisson. ef a/.. Nature . 380:207 (1996). all of which are incorporated by 
reference)). Other analog nucleic acids include those with positive backbones (Denpcy. ef al., Proc, 
N;,tl Acad. Sci. USA . 92:6097 (1995)): non-ionic backbones (U.S. Palent Nos. 5.386.023; 5.637.684; 
5.602.240; 5.216.141; and 4.469.863: Kiedrowshi. ef a/.. Anqew. Chem. Intl. Ed. English . 30:423 
(1991): Letsinger. ef a/.. .» Am. Chem. Soc- 110:4470 (1988); Letsinger. ef a/.. NucleosWe & 
Nucleotide. 13:1597 (1994); Chapters 2 and 3. ASC Symposium Series 580. "Carbohydrate 
Modifications In Antisense Research". Ed. Y.S. Sanghui and P. Dan Cook: Mesmaeker. et a/.. 
»tnnm«r.ic & MedirinAl Chem. Lett. . 4:395 (1994); Jeffs, ef a/.. .1 RIomolecular NMR . 34:17 (1994); 
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Tetrahedron Lett.. 37:743 (1996)) and non-ribose backbones, including those described in U.S. Patent 
Nos. 5,235.033 and 5.034.506. and Chapters 6 and 7, ASC Symposium Series 580. "Carbohydrate 
Modifications in Antisense Research". Ed. Y.S. Sanghui and P. Dan Cook. Nucleic acids containing 
one or more cart>ocyclic sugars are also included within the definition of nucleic acids (see Jenkins, et 
5 ai, Chem. Soc> Rev. . (1995) pp. 169-176). Several nucleic acid analogs are described in Rawls. C & 
E News, June 2, 1997. page 35. All of these references are hereby expressly incorporated by 
reference. These modifications of the ribose-phosphate backbone may be done to facilitate the 
addition of additional moieties such as labels, or to increase the stability and half-life of such 
molecules in physiological environments. In addition, mixtures of naturally occurring nucleic acids and 

10 analogs can be made. Altematively, mixtures of different nucleic acid analogs, and mixtures of 

naturally occuring nucleic acids and analogs may be made. The nucleic acids may be single stranded 
or double stranded, as spedfied. or contain portions of both double stranded or single stranded 
sequence. The nucleic acid may be DNA, t>oth genomic and cDNA. RNA or a hybrid, where the 
nucleic acid contains any combination of deoxyribo- and ribo-nucleotides, and any combination of 

15 bases, including uracil, adenine, thymine, cytosine, guanine, inosine. xathanine hypoxathanlne, 
isocytosine, isoguanine. etc. 

As described above generally for proteins, nucleic add candidate bioactive agents may be naturally 
occuring nucleic acids, random nudeic adds, or "biased" random nudeic adds. For example, digests 
of procaryotic or eucaryotic genomes may t>e used as is outlined above for proteins. 

20 In a prefen-ed embodiment, the candidate bioactive agents are organic chemical moieties, a wide 
variety of which are available an the literature. 

In a preferred embodiment, a library of different candidate bioactive agents are used. Preferably, the 
library should provide a sufficiently structurally diverse populatk>n of randomized agents to effect a 
probabilistically sufficient range of diversity to allow binding to a particular target Accordingly, an 

25 interaction library should be large enough so that at least one of its members will have a structure that 
gives it affinity for the target. Although it is difficult to gauge the required absolute size of an inter- 
action library, nature provides a hint with the immune response: a diversity of 10^-10° different antibod- 
ies provides at least one combination with suffident affinity to interad with most potential antigens 
faced by an organism. Published in vitro selection techniques have also shown that a library size of 

30 10^ to 10^ is suffident to find structures with affinity for the target. A library of all combinations of a 

peptide 7 to 20 amino acids in length, such as generally proposed herein, has the potential to code for 
20^ (10®) to 20^ . Thus, with libraries of 10^ to 10® different molecules the present methods allow a 
"working" subset of a theoretically complete interaction library for 7 amino acids, and a subset of 
shapes for the 20^ library. Thus, in a preferred embodiment, at least 10^. preferably at least 10^, 
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more preferably at least 10" and most preferably at least 10" different sequences are simultaneously 
analyzed in the subject methods. Preferred methods maximize library size and diversity. 

The candidate bioactive agents are combined or added to a cell or population of cells. Suitable cell 
types for different embodiments are outlined above. By "population of cells" herein is meant at least 
two cells, with at least about 10» being preferred, at least about 10« being particularly prefenred. and at 
least about 10'. 10* and 10* being especially preferred. 

The candidate bioactive agent and the cells are combined. As will be appreciated by those in the art. 
this may accomplished in any number of ways, including adding the candidate agents to the surface of 
the cells to the media containing the cells, or to a surface on which the cells are growing or in contact 
with: adding the agents Into the cells, for example by using vectors that will introduce the agents .nto 
the cells (i.e. when the agents are nucleic adds or proteins). 

in a preferred embodiment, the candidate bioactive agents are either nucleic acids or proteins 
(proteins in this context includes proteins, oligopeptides, and peptides) that are introduced into the 
host cells using retroviral vectors, as is generally outlined in PCT US97/01019 and PCT US97/01048. 
both of which are expressly incorporated by reference. Generally, a libranr of retroviral vectors is made 
using retroviral packaging cell lines that are helper-defective and are capable of producing all the 
necessary trans proteins, indoding gag. pol and env. and RWK molecules that have in ds the v 
padcaglng signal. Briefly, the library is generated In a retrovirus DNAconstnict badcbone: standard 
ollgonudeotide synthesis is done to generate either the candidate agent or nudeic add encoding a 
protein, for example a random peptide, using tediniques well known in the art. After generation of the 
DNA library, the libran^ is doned into a first primer. The first primer serves as a -cassette", whidi is 
inserted into the retroviral construd. The first primer generally contains a number of elements, 
induding for example, the required regulatory sequences (e.g. translation, transcription, promoters, 
etc) fusion partners, restridion endonudease (doning and subdoning) sites, stop codons (preferably 
in an three frames), regions of complementarity for second strand priming (preferably at the end of the 
stop codon region as minor deletions or insertions may occur in the random region), etc. 

A second primer is then added, whidi generally consists of some or all of the complementarity region 
to prime the first primer and opUonal necessary sequences for a second unique restriction site for 
subdoning. DNA polymerase is added to make double-stranded oligonudeoUdes. The double- 
stranded oligonudeotides are deaved with the appropriate subdoning resUiction endonucleases and 
subdoned into the target retroviral vedors. described below. 

Any number of suitable retroviral vedors may be used. Generally, the retroviral vectors may indude: 
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selectable marker genes under the control of internal ribosome entry sites (IRES) that greatly 
facilitates the selection of cells expressing peptides at uniformly high levels: and promoters driving 
expression of a second gene, placed in sense or anti-sense relative to the 5' LTR. Suitable selection 
genes include, but are not limited to, neomycin, blastocidin, bleomycin, puromycin. and hygromycin 
5 resistance genes, as well as self*f)uorescent markers such as green fluorescent protein including rr, 
enzymatic markers such as lacZ, and surface proteins such as CDS, etc. 

Preferred vectors include a vector based on the murine stem cell virus (MSCV) (see Hawley et aL. 
Gene Therapy 1:136 (1994)) and a modified MFG virus (Rivere et a!.. Genetics 92:6733 (1995)). and 
pBABE, outlined in the examples. 

10 The retroviruses may include inducible and constitutive promoters for the expression of the candidate 
agent (to be distinguished from the lL-4 Inducible e promoter). For example, there are situations 
wherein it is necessary to induce peptide expression only during certain phases of the selection 
process. A large number of both inducible and constitutive promoters are known. 

In addition, it is possible to configure a retroviral vector to allow inducible expression of retroviral 
15 inserts after integration of a single vector in target cells; importantly, the entire system is contained 
within the single retrovirus. Tet-induclble retroviruses have t>een designed incorporating the Self- 
Inactivating (SIN) feature of 3' LTR enhancer/promoter retroviral deletion mutant (Hoffman et al., 
PNAS USA 93:5185 (1996)). Expression of this vector in cells is virtually undetectable in the presence 
of tetracycline or other active analogs. However, in the absence of Tet. expression is turned on to 
20 maximum within 48 hours after induction, with uniform increased expression of the whole population of 
cells that harbor the inducible retrovirus, indicating that expression is regulated uniformly within the 
infected cell population. A similar, related system uses a mutated Tet DNA-binding domain such that 
it bound DNA in the presence of Tet, and was removed in the absence of Tet Either of these systems 
is suitable. 

25 In a preferred embodiment, the candidate bioactive agents are linked to a fusion partner as defined 
above. 

* 

In a preferred emk>odiment. the invention provides compositions and methods utilizing p- or rGFP as a 
reporter molecule for use in cell assays. As will be appreciated by those in the art, any assay for 
which a reporter gene can be used can be run using p- or rGFP. 

30 In a preferred embodiment, the present Invention provides compositions and methods utilizing p- or 
rGFP (and/or pGFP) and a chip device comprising integrated photodetectors at individual loci. The 
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method may be practiced with any suitable chip device that includes an electronic circuit capable of 
reading the sensed signal generated by each photodetector and generating output data signals 
therefrom. The output data signals are indicative of the light emitted, due to the presence of p- or 
rGFP, at the various loci. As will be appreciated by those in the art. any assay that evaluates binding 
interactions can utilize the present invention. 

Thus, the present invention finds use in a variety of assays, including but not limited to. assays for 
protein-protein Interactions, protein-nucleic acid interactions, and nucleic acid-nucleic acid 
interactions. 

In a prefen-ed embodiment, any cellular assay that evaluates the effects of candidate agents, 
preferably either nucleic acids or proteins (including peptide), can utilize the present invention. In this 
embodiment, the candidate agents are fused to the p- or rGFP proteins of the present invention, 
generally through making fusion nucleic adds and transforming into the cells to be assayed under 
conditions that allow expression (if peptides are used) of the candidate agent. This allows a 
confirmation that the candidate agent has been expressed, as well as tracking and localization of the 
candidate agent, and the ability to sort cells comprising the candidate agents. 

Thus, the present Invention finds use in a variety of cellular assays, including but not limited to, assays 
for alteraUons in exocytosis, cell cycle regulation, apoptosis, cellular proliferation and/or differentation. 
etc. The cells screened can also be a variety of cell types, including, but not limited to, any cells 
outlined herein. Including mast cells. T cells, B cells, macrophages, adlpocyteis, smooth muscle cells, 
etc. 

In addition, as outlined herein, the p- or rGFP proteins of the invention find particular use in screening 
assays that require a reporter protein, as outlined below. 

The present invention is directed to the detection of alterations in cellular phenotypes, such as cell 
cyde regulation, exocytosis. small molecule toxicity, cell surface receptor expression, enzyme 
expression, etc. by evaluating or assaying a variety of cellular parameters, generally through the use 
of a fluorescence-activated cell sorter (FACS) machine. There are a number of parameters that can 
be measured to allow detection of alterations in a variety of cellular phenotypes as is more hX: 
outlined below. By assaying a plurality of these parameters either sequentially or preferably 
simultaneously, rapid and accurate screening may be done. 

In a preferred embodiment, the methods outlined herein are used to screen for modulators of cellular 
phenotypes. Cellular phenotypes that may be assayed indude, but are not limited to, cellular 



.0134824A2_I_> 



^oomAnA pcr/usoo«09i5 

-47- 



apoptosis. including cell cycle regulation, exocytosis. toxicity to small molecules, the expression of any 
number of moieties including receptors (particularly cell surface receptors), adhesion molecules, 
cytokine secretion, protein-protein interactions, etc. As will be appreciated by those in the art. any 
number of cellular assays that rely on p- or rGFP can be developed. Thus, in a preferred 
5 embodiment, the invention provides methods of screening comprising providing cell lines comprising 
nucleic acids encoding an p- or rGFP protein, adding candidate bioactive agents and detecting 
changes in chellular phenotype. The nucleic acid may preferably be a fusion nucleic add, encoding a 
gene or regulatory element of interest operably linked to an p- or rGFP protein. 

In a preferred embodiment, the methods are used to evaluate cell cycle regulation. In this 
10 embodiment, preferred cellular parameters or assays are cell viability assays, assays to determine 

whether cells are arrested at a particular cell cycle stage ("cell proliferation assays"), and assays to 

determine at which cell stage the cells have arrested ("cell phase assays"). By assaying or measuring . 

one or more of these parameters, it is possible to detect not only alterations in cell cycle regulation, but 

alterations of different steps of the cell cycle regulation pathway. This may be done to evaluate native 
15 celts, for example to quantify the aggressiveness of a tumor cell type, or to evaluate the effect of 

candidate drug agents that are being tested for their effect on cell cycle regulation. In this manner. 

rapid, accurate screening of candidate agents may be performed to identify agents that modulate cell 

cycle regulation. 

Thus, the present methods are useful to elucidate bioactive agents that can cause a population of 
2 0 cells to either move out of one growth phase and into another, or arrest in a grov\fth phase. In some 
emtxxiiments, the cells are arrested in a particular grov^ phase, and it is desirable to either get them 
out of that phase or into a new phase. Alternatively, It may be desirable to force a ceil to an^st in a 
phase, for example G1 , rather than continue to move through the cell cycle. Similarly, it may be 
desirable in some circumstances to accelerate a non-arrested but slowly moving population of cells 
25 into either the next phase or just through the cell cycle, or to delay the onset of the next phase. For 
example, it may be possible to alter the activities of certain enzymes, for example kinases, 
phosphatases, proteases or ubiquitination enzymes, that contribute to initiating cell phase changes. 

In a preferred embodiment, the methods outlined herein are done on cells that are not arrested In the 
G1 phase: that is. they are rapidly or uncontrollably growing and replicating, such as tumor cells. In 
30 this manner, candidate agents are evaluated to find agents that can alter the cell cycle regulation, i.e. 
cause the cells to arrest at cell cycle checkpoints, such as in G1 (although arresting in other phases 
such as S. G2 or M are also desirable). Alternatively, candidate agents are evaluated to find agents 
that can cause proliferation of a population of cells, i.e. that allow cells that are generally arrested in 
G1 to start proliferating again; for example, peripheral blood cells, tenminally differentiated cells, stem 
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cells in culture, etc. 

Accordingly, in a preferred embodiment, the invention provides methods for screening for alterations in 
cell cycle regulation of a population of cells. "Alteration- and -modulation" (used herein 
interchangeably), as used herein can include both increases and decreases in the parameter or 
5 phenotype being measured. By -alteration- or -modulation" in the context of cell cycle regulation, is 

generally meant one of two things. In a preferred embodiment, the alteration results in a change in the 
cell cycle of a cell. i.e. a proliferating cell arrests in any one of the phases, or an arrested cell moves 
out of its arrested phase and starts the cell cycle, as compared to another cell or in the same cell 
under different conditions. Altematively. the progress of a cell through any particular phase may be 
1 0 altered; that is. there may be an acceleration or delay in the length of time It takes for the cells to move 
thorough a particular growth phase. For example, the cell may be normally undergo a G1 phase of 
several hours; the addition of an agent may prolong the G1 phase. 

The measurements can be determined wherein all of the conditions are the same for each 
measurement, or under various conditions, with or without bioactive agents, or at different stages of 

15 the cell cycle process. For example, a measurement of cell cycle regulation can be determined in a 
cell population wherein a candidate bioactive agent is present and wherein the candidate bioactive 
agent is absent. In another example, the measurements of cell cycle regulation are determined 
wherein the condition or environment of the populations of cells differ from one another. For example, 
the cells may be evaluated in the presence or absence of physiological signals, for example 

2 0 hormones, antibodies, peptides, antigens, cytokines, growth factors, action potentials, 

pharmacological agents (i.e. chemotherapeutics. etc.), or other cells (i.e. cell-cell contacts). In another 
example, the measurements of cell cycle regulation are detennined at different stages of the cell 
cycle process. In yet another example, the measurements of cell cycle regulation are taken wherein 
the conditions are the same, and the alterations are between one cett or cell population and another 

2 5 cell or cell population. 

In a preferred embodiment, the candidate bioactive agents are peptides and are fused with p- or rGFP 
proteins; fusion nucleic acids are made, transformed Into the cells and expressed. The presence of a 
signal from the p- or rGFP protein shows that the candidate agent is expressed. The cells can then be 
screened as below, to delect agents that effect cell viability, etc. 

30 By a -population of cells" or -library of cells" or -plurality of cells" herein is meant at least two cells, with 
at least about 10^ being preferred, at least about 10« being particularly prefen^ed. and at least about 
10' to 10® being especially prefen-ed. The population or sample can contain a mixture of different cell 
types from either primary or secondary cultures although samples containing only a single cell type are 
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preferred, for example, the sample can be from a cell line, particularly tumor cell lines (particularly 
when . as outlined below. The cells may be in any cell phase, either synchronously or not. including 
M, G1 . S. and G2. In a preferred embodiment, cells that are replicating or proliferating are used; this 
may allow the use of retroviral vectors for the introduction of candidate bioactive agents. Alternatively. 
5 non-replicating cells may be used, and other vectors (such as adenovirus and lentivirus vectors) can 
be used. In addition, although not required, the cells are compatible with dyes and antibodies. 

Preferred cell types for use in the invention will vary with the cellular phenotype to be modulated. 
Suitable cells include, but are not limited to. mammalian cells, including animal (rodents, including 
mice, rats, hamsters and gerbils). primates, and human cells, particularly including tumor cells of all 
10 types, including breast, skin, lung, cervix, colonrectal, leul^emia, brain, etc. As outlined below, 
additional cell types may be used for screening for exocytosis. 

In a preferred emfcxxJiment, the cell cycle regulation methods comprise sorting the cells in a FACS 
machine by assaying several different cell parameters, including, but not limited to, celt viability, cell 
proliferation, and cell phase. 

15 In a preferred emt>odiment, cell viability is assayed, to ensure that a lack of cellular change Is due to 
experimental conditions (i.e. the introduction of a candidate bioactive agent) rK>t cell death. There are 
a variety of suitable cell viability assays which can be used, including, but not limited to, light 
scattering, viability dye staining, and exclusion dye staining. 

In a preferred embodiment, a light scattering assay is used as the viability assay, as is well known in 
20 the art. When viewed in the FACS. cells have particular characteristics as measured by tiieir fonvard 
and 90 degree (side) light scatter properties. These scatter properties represent the size, shape and 
granule content of the cells. These properties account for two parameters to be measured as a 
readout for the viability. Briefly, the DNA of dying or dead cells generally condenses, which alters tiie 
90* scatter; similariy. membrane blebbing can alter the forward scatter. Alterations in the intensity of 

2 5 tight scattering, or the cett-refractlve index indicate alterations in viability. 

Thus, in general, for light scattering assays, a live cell population of a particular cell type is evaluated 
to determine it's forward and side scattering properties. This sets a standard for scattering that can 
subsequently be used. 

In a preferred embodiment, the viability assay utilizes a viability dye. There are a number of known 

3 0 viability dyes that stain dead or dying celts, but do not stain growing celts. For example, annexin V is a 

member of a protein family which displays specific binding to phospholipid (phosphotidytserine) in a 
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divalent ion dependent manner. This protein has been widely used for the measurement of apoptosis 
(programmed cell death) as cell surface exposure of phosphatidylserine Is a hallmark early signal of 
this process. Suitable viability dyes include, but are not limited to. annexin. ethidium homodimer-1. 
DEAD Red propidium iodide. SYTOX Green, etc.. and others known In the art: see the Molecular 
Probes Handbook of Fluorescent Probes and Research Chemicals. Haugland. Sixth Edition, hereby 
incorporated by reference: see Apoptosis Assay on page 285 in particular, and Chapter 16. 

Protocols for viability dye staining for cell viability are known, see Molecular Probes catalog, supra. In 
this embodiment, the viability dyesuch as annexin is labeled, either directly or indirectly, and 
combined with a cell population. Annexin is commercially available, i.e.. from Phart^ingen. San Diego. 
California or Caltag Laboratories. Millbrae. California. Preferably, the viability dye is provided .n a 
solution wherein the dye is in a concentraUon of about 100 ng/ml to about 500 ng/ml. more preferably, 
about 500 ng/ml to about 1 pg/ml. and most preferably, from about 1 wg/ml to about 5 MQ/ml. In a 
preferred embodiment, the viabilrty dye is directly labeled: for example, annexin may be labeled wrth a 
fluorachrome such as fluorecein Isothlocyanate (FITC). Alexa dyes. TRITC. AMCA. APC. tri-color. Cy- 
5 and others known In the art or commercially available. In an alternate preferred embodiment, the 
viability dye Is labeled with a first label, such as a hapten such as biotin. and a secondary fluorescent 
label is used, such as fluorescent streptavidin. Other first and second labeling pairs can be used as 
will be appreciated by those in the art. 

Once added, the viability dye is allowed to Incubate with the cells for a period of time, and washed. If 
necessary. The cells are then sorted as outlined below to remove the non-viable cells. 

in a piefened embodiment, exclusion dye staining is used as the viability assay. Exclusion dyes are 
those whtah are excluded from living cells, i.e. they are not taken up passively (they do not pemneate 
the cell membrane of a live cell). However, due to the perriieability of dead or dying cells, they are 
taken up by dead cells. Generally, but not always, the exclusion dyes bind to DMA. for example via 
intercalation. Preferably, the exclusion dye does not fluoresce, or fluoresces poorty. In the absence of 
DNA- this eliminates the need for a wash step. ARematlvely. exclusfon dyes that require the use of a 
secondary labet may also be used. Preferred exclusion dyes include, but are not limited to. ethidium 
bromide: ethidium homodimer-1: propidium iodine: SYTOX green nucleic acid stain: Calcein AM. 
BCECF AM: fluoreccein diacetate: TOTO® and TO-PRO™ (from Molecular Probes: supra, see 
chapter 16) and others known in the art. 

Protocols for exclusion dye staining for cell viability are known, see the Molecular Probes catalog, 
supra In general, the exclusion dye is added to the cells at a concentration of from about 100 ng/ml to 
about 500 ng/ml. more preferably, about 500 ng/ml to about 1 pg/ml. and most preferably, from about 
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0.1 pg/ml to about 5 pg/ml. with about 0.5 iJQfvn\ being particularly preferred. The cells and ttie 
exclusion dye are incubated for some period of time, washed* if necessary, and then the cells sorted 
as outlined below, to remove non-viable ceils from the population. 

In addition, there are other cell viability assays which may be run, including for example enzymatic 
5 assays, which can measure extracellular enzymatic activity of either live cells (i.e. secreted proteases, 
etc.). or dead cells (i.e. the presence of intracellular enzymes in the media: for example, intracellular 
proteases, mitochondrial enzymes, etc.). See the f^olecular Probes Handbook of Fluorescent Probes 
and Research Chemicals. Haugland, Sixth Edition, hereby incorporated by reference; see chapter 16 
in particular. 

10 In a prefen-ed embodiment, at least one cell viability assay is run. with at least two different cell viability 
assays being preferred, when the fluors are compatible. When only 1 viability assay is run. a preferred 
embodiment utilizes light scattering assays (both forward and side scattering). When two viability 
assays are run, preferred embodiments utilize light scattering and dye exclusion, with light scattering 
and viability dye staining also possible, and all three t>eing done in some cases as well. Viability 

15 assays thus allow the separatlori of viable cells from non-viable or dying cells. 

In addition to a cell viability assay, a preferred embodiment utilizes a cell proliferation assay. By 
"proliferation assay" herein is meant an assay that allows the determination that a cell population is 
either proliferating, i.e. replicating, or not replicating. 

In a prefen-ed embodiment, the proliferation assay is a dye inclusion assay. A dye inclusion assay 

2 0 relies on dilution effects to distinguish between cell phases. Briefly, a dye (generally a fluorescent dye 

as outlined below) is introduced to cells and talcen up by the cells. Once taken up, the dye is trapped in 
the cell, and does not diffuse out As the cell population divides, the dye is proportionally diluted. That 
is, after the introduction of the inclusion dye, the cells are allowed to incubate for some period of time; 
cells that lose fluorescence over time are dividing, and the cells that remain fluorescent are arrested in 
25 a non-growth phase. 

Generally, the Introduction of the inclusion dye may be done in one of two ways. Either the dye cannot 
passively enter the cells (e.g. il is charged), and the cells must be treated to take up the dye: for 
example through the use of a electric pulse. Alternatively, the dye can passively enter the cells, but 
once taken up, it is modified such that it cannot diffuse out of the cells. For example, enzymatic 

3 0 modification of the inclusion dye may render it charged, and thus unable to diffuse out of the cells. For 

example, the Molecular Probes CellTracker^ dyes are fluorescent chloromethyl derivatives that freely 
diffuse into cells, and then glutathione S-transferase-mediated reaction produces membrane 
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impermeant dyes. 

Suitable inclusion dyes include, but are not limited to. the Molecular Probes line of CellTracker™ dyes 
. including, but not limited to CellTracker^- Blue. CellTracker™ Yellow-Green. CellTracker'- Green. 
CellTracker™ Orange. PKH26 (Sigma), and others known in the art: see the Molecular Probes 
5 Handbook, supra; chapter 15 in particular. 

In general, inclusion dyes are provided to the cells at a concentration ranging from about 
100 ng/ml to about 5 pg/ml; with from about 500 ng/ml to about 1 pg/ml being preferred. A wash step 
may or may not be used. In a preferred embodiment, a candidate bioactive agent is combined with the 
cells as described herein. The cells and the inclusion dye are incubated for some period of time, to 

10 altow cell division and thus dye dilution. The length of time will depend on the ceil cycle time for the 
particular ceiis; In general, at least about 2 cell divisions are preferred, with at least about 3 being 
particulariy preferred and at least about 4 being especially prefered. The cells are then sorted as 
outiined betow. to create populations of cells that are replicating and those that are not. As will be 
appreciated by those in the art. in some cases, for example when screening for antl-proliferatlon 

1 5 agents, the bright (i.e. fluorescent) cells are collected; in other embodiments, for example for 

screening for proliferation agents, the low fluorescence cells are collected. Alterations are detenmined 
by measuring tt»e fluorescence at either different time points or In different cell populations, and 
comparing the detemninations to one another or to standards. 

In a preferred embodiment, the proliferation assay is an antimetabolite assay. In general. 
2 0 antimetabolite assays find the most use when agents ttiat cause cellular arrest in G1 or G2 resting 

phase is desired. In an antimetabolite proliferation assay, the use of a toxic antimetabolite that will kill 
dividing cells will result in survival of only those cells tiiat are not dividing. Suitable antimetabolites 
include, but are not limited to, standard chemoUierapeutic agents such as methoto«xate. cisplatin. 
taxol. hydroxyurea, nucleotide analogs such as AraC, etc. In addition, antimetabolite assays may 

2 5 indude ttie use of genes that cause cell deatti upon expression. 

The concentration at which the antimetabolite is added will depend on the toxicity of the particular 
antimetabolite, and will be determined as is known in the art. The antimetabolite is added and the 
cells are generally incubated for some period of time; again, the exact period of time wilt depend on 
ti^e characteristics and identity of the antimetabolite as well as the cell cycle time of the particular cell 

3 0 population. Generally, a time sufficient for at least one cell division to occur. 

In a preferred embodiment, at least one proliferation assay is run. with more than one being prefen-ed. 
Thus, a proliferation assay results in a population of proliferating cells and a population of arrested 
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cells. 

In a preferred embodiment, either after or simultaneously with one or more of the proliferation assays 
outlined above, at least one cell phase assay is done. A "cell phase" assay determines at which cell 
phase the cells are arrested. M, Gl^ S, or G2. 

In a preferred embodiment, the cell phase assay is a DNA binding dye assay. Briefly, a DNA binding 
dye is introduced to the cells, and taken up passively. Once inside the cell, the DNA binding dye binds 
to DNA. generally by intercalation, although in some cases, the dyes can be either major or minor 
groove binding compounds. The amount of dye is thus directly correlated to the amount of DNA in the 
cell, which varies by cell phase; G2 and M phase cells have twice the DNA content of G1 phase cells, 
and S phase cells have an Intermediate amount, depending on at what point in S phase the cells are. 
Suitable DNA binding dyes are pemieant. and include, but are not limited to. Hoechst 33342 and 
33258, acridine orange. 7-AAD, LDS 751, DAPI, and SYTO 16. Molecular Probes Handbook, supra; 
chapters 8 and 16 in particular. 

In general, the DNA binding dyes are added in concentrations ranging from about 1 pg/ml to about 5 
pg/ml. The dyes are added to the cells and allowed to incubate for some period of time; the length of 
time will depend in part on the dye chosen. In one embodiment, measurements are taken 
immediately after addition of the dye. The cells are then sorted as outlined below, to create 
populations of cells that contain different amounts of dye. and thus different amounts of DNA; In this 
way. cells that are replicating are separated from those that are not. As will be appreciated by those in 
the art, in some cases, for example when screening for anti-proliferation agents, cells with the least 
fluorescence (and thus a single copy of the genome) can be separated from those that are replicating 
and thus contain more than a single genome of DNA. Alterations are determined by measuring the 
fluorescence at either different time points or in different cell populations, and comparing the 
determinations to one another or to standands. 

In a preferred embodiment, the cell phase assay is a cyclin destruction assay. In this embodiment, 
prior to screening (and generally prior to the introduction of a candidate bioactive agent, as outlined 
below), a fusion nucleic acid is introduced to the cells. The fusion nucleic acid comprises nucleic acid 
encoding a cyclin destruction box and a nucleic acid encoding a delectable molecule. "Cyclin 
destruction boxes" are known in the art and are sequences that cause destruction via the 
ubiquitination pathway of proteins containing the boxes during particular cell phases. That is, for 
example, G1 cyclins may be stable during G1 phase but degraded during S phase due to the 
presence of a G1 cyclin destruction box. Thus, by linking a cyclin destmction box to a detectable 
molecule, for example green fluorescent protein, the presence or absence of the detectable molecule 
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can serve to identify the ceil phase of the cell population. In a preferred embodiment, multiple boxes 
are used, preferably each with a different fluor. such that detection of the cell phase can occur. 

A number of cyclin destruction boxes are known in the art. for example, cyclin A has a destruction box 
comprising the sequence RTVLGVIGD; the destruction box of cyclin B1 comprises the sequence 
RTALGDIGN. See Glotzer et al.. Nature 349:132-138 (1991). Other destruction boxes are known as 
well: YMTVSIIDRFMQDSCVPKKMLQLVGVT (rat cyclin B); KFRLLQETMYMTVSIIDRFMQNSCVPKK 
(mouse cyclin B); RAILIDWLIQVQMKFRLLQETMYMTVS (mouse cyclin B1); 
DRFLQAQLVCRKKLQWGITALLLASK (mouse cyclin B2): and MSVLRGKLQLVGTAAMLL (mouse 

cyclin A2). 

The nucleic acid encoding the cyclin destruction box is operably linked to nucleic acid encoding a 
detectable molecule. The fusion proteins are constructed by methods krwwn in the art. For example, 
the nudeic acids encoding the destruction box is ligated to a nucleic acid encoding a p- or rGFP 
protein. 

Accordingly, the results of sorting after cell phase assays generally result in at least two populations of 
cells that are in different cell phases. 

In a preferred embodiment, the methods are used to screen candidate bioacHve agents for tiie ability 
to modulate cell cyde regulation, induding the activation or suppression of cell cycle checkpoint 
pathways and ameliorating checkpoint defects. The candidate bioactive agent can be added to the 
cen population exogenously or can be introduced into the cells as described further herein. 

In a preferred embodiment, the methods are used to screen candidate bioactive agents for Uie ability 
to modulate cell cyde regulation, induding the activation or suppression of cell cyde checkpoint 
pathways and ameliorating checkpoint defects. The candidate bioactive agent can be added to the 
cen population exogenously or can be introduced Into the cells as described further herein. 

As above, when tt>e candidate agents are nucleic adds or peptides, fusion partners, are defined 
herein, may be used. The fusion partners, induding presentation structures, may be modified, 
randorr.^zed. and/or matured to alter the presentation orientation of the randomized expression 
product. For example, detemiinants at the base of the loop may be modified to slightly modify the 
Intemal loop peptide tertiary stmcture. which maintaining the randomized amino add sequence. 

In a preferred embodiment, combinations of fusion partners are used. Thus, for example, any number 
of combinations of presentation stnjdures. targeting sequences, rescue sequences, and stability 
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sequences may be used» with or without linker sequences. 

Thus, candidate agents can include these components, and may then be used to generate a library of 
fragments, each containing a different random nucleotide sequence that may encode a different 
peptide. The ligation products are then transformed into bacteria, such as E. co//, and DNA is 
prepared from the resulting library, as is generally outlined in Kitamura. PNAS USA 92:9146-9150 
(1995), hereby expressly incorporated by reference. 

Delivery of the library DNA into a retroviral packaging system results in conversion to infectious virus. 
Suitable retroviral packaging system cell lines include, but are not limited to. the Bing and BOSC23 
cell lines described in WO 94/19478; Soneoka et al., Nucleic Acid Res. 23(4):628 (1995); Finer et al.. 
Blood 83:43 (1994); Pheonix packaging lines such as PhiNX-eco and PhiNX-ampho, described below; 
292T + gag-pol and retrovirus envelope; PA317; and cell lines outlined in Markowitz et al., Virology 
167:400 (1988). Markowitz et al.. J. Virol. 62:1 120 (1988), Li et al.. PNAS USA 93:11658 (1996), 
Kinsella et al.. Human Gene Therapy 7:1405 (1996). all of which are incorporated by reference. 
Preferred systems include PhiNX-eco and PhlNX-ampho or similar cell lines, disclosed in PCT 
US97/01019. 

When the cells are not replicating, other viral vectors may be used, including adenoviral vectors, feline 
immunoviral (FiV) vectors, etc. Thus, in a prefenred embodiment, adenoviral vectors comprising a p- 
or rGFP gene are provided. Similarty, FIV vectors comprising an p- or rGFP gene are provided. 

In a preferred embodiment, when the candidate agent is introduced to the cells using a viral vector, 
the candidate peptide agent is linked to an p- or rGFP gene, and the methods of the invention include 
at least one expression assay. An expression assay is an assay that allows the determination of 
whether a candidate bioactive agent has been expressed. I.e. whether a candidate peptide agent is 
present in the cell. Thus, by linking the expression of a candidate agent to tiie expression of p- or 
rGFP protein, the presence or absence of the candidate peptide agent may be determined. 
Accordingly, in this embodiment, the candidate agent is operably linked to a detectable molecule. 
Generally, this is done by creating a fusion nucleic acid. The fusion nucleic acid comprises a first 
nucleic acid encoding the candidate bioactive agent (which can include fusion partners, as outiined 
above), and a second nucleic acid encoding a detectable molecule. The terms "first" and "second" are 
not meant to confer an orientation of the sequences with respect to 5'-3* orientation of the fusion 
nucleic acid. For example, assuming a 5-3' orientation of the fusion sequence, the first nucleic acid 
may be located either 5' to the second nucleic acid, or 3* to the second nucleic acid. Prefen-ed 
detectable molecules in this embodiment include, but are not limited to, p- or rGFP and pGFP. 
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In general, the candidate agents are added to the cells (either extracellularty or Intracellularly, as 
outlined above) under reaction conditions that favor agent-target interactions. Generally, this will be 
physiological conditions. Incubations may be performed at any temperature which facilitates optimal 
activity, typically between 4 and 40'C. Incubation periods are selected for optimum activity, but may 
5 also be optimized to facilitate rapid high through put screening. Typically between 0.1 and 1 hour will 
be sufficient. Excess reagent is generally renrK>ved or washed away. 

A variety of other reagents may be included in the assays. These include reagents like salts, neutral 
proteins, e.g. albumin, detergents, etc which may be used to facilitate optimal protein-protein binding 
and/or reduce non-specific or background interactions. Also reagents that otherwise improve the 

10 efficiency of the assay, such as protease inhibitors, nuclease Inhibitors, anti-microbtal agents, etc., 
may be used. The mixture of components may be added in any order that provides for detection. 
Washing or rinsing the cells will be done as will t>e appreciated by those in the art at different times, 
and may include the use of filtration and centrifugation. When second labeling moieties (also referred 
to herein as "secondary labels'*) are used, they are preferably added after excess non-bound target 

15 molecules are removed, in order to reduce non-specific binding; however, under some circumstances, 
all the components may be added simultaneously. 

In a prefen-ed embodiment, the cells are sorted using fluorescent-activated cell sorting (FACS). In the 
invention herein, cell cycle regulation is evaluated by multiple parameters which results in reduced 
background and greater specificity. In contrast, FACS has been used in the past to evaluate two 

2 0 different or unrelated characteristics at the same time which identifies cells having those two 

characteristics, but does not reduce the background for the combined characteristics. 

Thus, the cells are sorted or enriched in a FACS on the basis of one or more of the assays, including a 
cell viability assay, a proliferation assay, a cell phase assay, and (when candidate agents are 
expressed with detectable moieties) an expression assay. The results from one or more of these 
25 assays are compared to cells that were not exposed to the candidate btoactive agent, or to the same 
cells prior to introduction of the candidate agent. Alterations in these results can Indicate that said 
agent modulates cell cycle regulation. 

A strength of the present invention is that a library of candidate agents may be tested in a library of 
cells, because the present methods allow single cell sorting, with extremely high specificity, such that 

3 0 very rare events may be detected. The use of multiple laser paths allows sort accuracy of 1 in 10^ with 

better than 70% accuracy. 

In addition, the present invention can, in addition to the identification of multiple cell cycle regulation 
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properties, be combined with the identiftcation of other cellular characteristics. For example, 
parameters of general cellular health can be determined and selected for by using i.e.. dye lndo-1 
indicating a calcium response. Other cellular parameters which are routinely identified by the skilled 
artisan include but are not limited to: cell size, cell shape, redox state. DNA content, nucleic acid 
5 sequence, chromatin structure. RNA content, total protein, antigens, lipids, surface proteins. 

intracellular receptors, oxidative metabolism. DNA synthesis and degradation and intracellular pH. 

In a preferred embodiment, each of the measurements is determined simultaneously from an 
individual cell as it passes through the beam paths of multiple lasers. Alternatively, the measurements 
are done sequentially. By using more than one parameter to detect cell cycle regulation or alterations 
10 in cell cycle regulation, background Is reduced and specificity is increased. The cells meeting the 
parameters of the desired properties can be physically sorted from cells not meeting the desired 
parameters or they can be identified by their percentage in the cell population. 

In general. Kq s of < 1 pM are preferred, to allow for retention of binding in the presence of the shear 
forces present in FACS sorting. In a preferred embodiment, the cells are sorted at very high speeds. 
15 for example greater than about 5,000 sorting events per sec, with greater than about 10,000 sorting 
events per sec being preferred, and greater than about 25,000 sorting events per second being 
particularly prefen-ed, with speeds of greater than about 50,000 to 100,000 being especially preferred. 

Cells processed for stimulation and staining are generally taken up in buffer and filtered prior to 
cytometry. Cells can be analyzed using a FACSCAN (Becton Dickinson Inc., laser line 488nm) or a 
20 Mo-Flo (Cytomation. Inc., laser lines 350nM broadband (UV). 488nm, and 647nm) Cytometer. Cells 
are sorted, if desired, using the Mo-Flo. 

Wherein the celfs are analyzed by microscopy, cells post stimulation or staining are generally mounted 
onto glass slides and coverslipped; these are directly visualized by brightfietd and fluorescence 
25 microscopy on an inverted microscope (i.e., TE300, Nikon) using standard filter sets. Images can also 
be obtained using an inverted confocal scanning microscope (Zeiss, Inc., Bio-Rad, Inc.) using 
standard filter sots. 

The sorting results in a population of cells having the desired properties. In a preferred embodiment, 
the parameters are set to identify at least one candidate bioactive agent that modulates cell cycle 
3 0 regulation. 

In a preferred embodiment, the bioactive agent is characterized. This will proceed as will be 
appreciated by those in the art. and generally includes an analysis of the structure, identity, binding 
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affinity and function of the agent. Generally, once identified, the bioactive agent is resynthesized and 
combined with the target ceil to verify the cell cycle regulation modulation under various conditions 
and in the presence or absence of other various agents. The bioactive can be prepared in a 
therapeutically effective amount to modulate cell cycle regulation and combined v^ilh a suitable 
5 pharmaceutical carrier. 

In a preferred embodiment, the cell populations can be subjected to various experimental conditions, 
with and without the candidate agents. Changes in conditions include but are not limited to changes in 
pH. temperature, buffer or salt concentration, etc. In a prefen-ed embodiment, the pH Is changed, 
generally by increasing or decreasing the pH. usually by from about 0.5 to about 3 pH units. 
10 Altematively. the temperature is altered, with increases or decreases of from about 5"C to about 30 *C 
being preferred. Similariy. the salt concentration may be modified, with increases or decreases of 
from about 0.1 M to about 2 M being preferred. 

It is understood by the skilled artisan that the steps of the assays provided herein can vary in order. It 
is also understood, however, that while various options (of compounds, properties selected or order of 

15 steps) are provided herein, the options are also each provided individually, and can each be 

individually segregated from the other options provided herein. Moreover, steps which are obvious 
and l<nown In the art that will increase the sensitivity of the assay are intended to be Within the scope 
of this Invention. For example, there may be additionally washing steps, or segregation, isolation 
steps. Moreover, It is understood that In some cases detection is in the cells, but can also take place 

2 0 in the media, or vice versa. 

In a prefen-ed embodiment, the cellular phenotype is exocytosis, and the methods and compositions of 
the invention are directed to the detection of alterations in exocytosis. again using a FACS machine. 
There are a number of parameters that may be evaluated or assayed to allow the detection of 
alterations In exocytotic pathways. Including, but not limited to, light scattering, fluorescent dye uptake, 

2 5 fluorescent dye release, granule exposure, surface granule enzyme activity, and the quantity of 

granule specific proteins. By assaying or measuring one or more of these parameters, it is possible to 
detect not only alterations in exocytosis. but alterations of different steps of the exocytotic pathway. In 
addition, multiparameter analysis also reduces the background, or "false positives", that are detected. 
In this manner, rapid, accurate screening of candidate agents may be performed to identify agents that 

3 0 modulate exocytosis. 

In a prefen-ed embodiment, the invention provides mettiods for screening for alterations in exocytosis 
of a population of cells. By "alteration" or "modulation" in the context of exocytosis Is meant a 
decrease or an increase In the amount of exocytosis in one cell compared to another cell or In the 
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same cell under different conditions. The measurements can be determined wherein ail of the 
conditions are the same for each measurement, or under various conditions, with or without bloacllve 
agents, or at different stages of the exocytic process. For example, a measurement of exocytosis can 
be determined in a cell population wherein a candidate bioactive agent is present and wherein the 
5 candidate bioactive agent is absent. In another example, the measurements of exocytosis are 

determined wherein the condition or environment of the populations of cells differ from one another. 
For example, the cells may be evaluated in the presence or absence of physiological signals, such as 
exocytic inducers (i.e. Ca^*. ionomycin, etc.). hormones, antibodies, peptides, antigens, cytokines. 
grov\rth factors, action potentials, or other cells (i.e. cell-cell contacts). In another example, the 
10 measurements of exocytosis are determined at different stages of the exocytic process. In yet 

another example, the measurements of exocytosis are taken wherein the conditions are the same, 
and the alterations are between one cell or cell population and another cell or cell population. 

By a "population of cells" herein is meant a sample of cells as defined above. In this embodiment, the 
cells are preferably (but not required) to be rapidly growing, retrovirally Infectable, and compatible with 
15 dyes and antibodies. Preferred cell types for use In this embodiment, include, but are not limited to, 
mast cells, neurons, adrenal chromaffin cells, basophils, endocrine cells including pancreatic |3-cells. 
pancreatic acinar cells including exocrine cells, neutrophils, monocytes, lymphocytes, mammary cells, 
sperm, egg cells and PMN leukocytes, endothelial cells, adipocytes, and muscle cells. 

The exocytotic methods comprise sorting the cells In a FACS machine by assaying for alterations in at 

2 0 least three of the properties selected from the group consisting of light scattering, fluorescent dye 

uptake, fluorescent dye release, granule exposure, surface granule enzyme activity, and the quantity 
of granule specific proteins. In a preferred embodiment, each of the measurements is determined 
simultaneously from an Individual cell as it passes through the beam paths of multiple lasers. 
Alternatively, the measurements are done sequentially. By using more than one parameter to detect 
25 exocytosis or alterations in exocytosis, background Is reduced and specificity Is increased. The celts 
meeting the parameters of the desired properties can be physically sorted from cells not meeting the 
desired parameters or they can be identified by their percentage in the cell population. 

In a preferred embodiment, changes in light scattering are assayed to determine alterations in 
exocytosis in a population of cells. When viewed ir. :: .: FACS. cells have particular characteristics as 

3 0 measured by their forward and 90 degree (side) light scatter properties. These scatter properties 

represent the size, shape and granule content of the cells. Upon activation of the cells with a pro- 
exocytic stimulus, both the fonvard and side scatter properties of the cells changes considerably. 
These properties account for two parameters to be measured as a readout for the exocytic event. 
These properties change in proportion to the extent of exocytosis of the cells and depend on the time 
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course of the exocytic events as well. Alterations in the intensity of light scattering, or the cell- 
refractive index indicate alterations in exocytosis either in the same cell at different times, or compared 
to the same cell under different conditions or vi^ith candidate bioactive agents present or absent, or 
compared to different cells or cell populations. 

In one embodiment provided herein, a cell population is combined with an agent which is known to 
stimulate exocytosis and the light scattering properties are determined. Cells having light scattering 
properties indicating the desirable exocytic activity can be identified and/or sorted. Exocytic activity as 
used herein includes lack of activity. In a preferred embodiment, candidate bioactive agents are 
combined with the cell population prior to or with the exocytic stimulus, as is more fully ouUined below. 
In this embodiment, where light scattering properties differ as between a) a cell population combined 
with a known exocytic stimulus and a candidate bioactive agent, and b) a cell population combined 
with a knowm exocyUc stimulus wrtierein the candidate bioacUve agent is absent. It can be determined 
that the car>dldate bioactive agent modulates exocytosis. It may also be desirable in some cases to 
Include an Inhibitor of exocytosis or to exclude the exocytic stimulus to Wentify bioactive agents which 
induce exocytosis. Preferably, light scattering properties are measured in combination with at least 
one. and preferably two other properties which indicate exocytosis activity. General methodologies for 
light scattering measurements are further described in Perretti. et al.. J. Pharmacol. Methods. 
23(3):187-194 (1990) and Hide et al.. J. Cell Bid.. 123(3):585-593 (1993). both Incorporated herein by 
reference. In general, changes of at least about 5% from baseline are preferred, vnth at least about 
25% being more preferred, at least about 50% being particularly preferred, and at least about 75 to 
100% being especially preferred. Baseline In this case generally means the light scatter properties of 
the cells prior to exocytotic stimulation. In each case provided herein, the baseline may also be set for 
any control parameter. For example, the baseline may be set at the exocytosis measurement of a 
partfcular cell, a similar cell under different conditions, or at a particular time point during exocytosis. 

In another preferred embodiment, changes in fluorescent dye uptake are evaluated. Preferred 
fluorescent dyes Include styiyl dyes, which indicate exocytosis activity in relation to endocytosis. 
sometimes referred to as coupled endocytosis. The theory behind coupled endocytosis is that cells 
undergoing exocytosis must also undergo endocytosis in order to maintain cell volume and membrane 
integrity. Thus, upon exocytic stimulation, endocytosis is also increased, providing an indirect 
measurement of e> -cytosis by quantifying the amount of styryl dye uptake. 

In an embodiment provided herein, the cells are bathed In a solution of styryl dye and stimulated with a 
pro^xocytic stimulus and the dye Is quanMtated. Preferably, after exocytic stimulation, the cells are 
spun down, aspirated and resuspended in fresh buffer. In a preferred embodiment, a candidate 
bioactive agent is combined with the cells as described herein. In some cases, the candidate 
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bioactive agent can be combined with the cells with an Inhibitor of exocytosis or without the pro- 
exocytic stimulus. Preferably, a pro-exocytic stimulus is added to the cell population which results in a 
dramatic increase in the fluorescence signal of the dye. The increased cell associated signal is due to 
coupled endocytosis of the styryl dye and is proportional to the exocytic response in both time and 
5 intensity. Conversely, the signal is not increased wherein exocytosis is inhibited or is not induced. 

Alterations are determined by measuring the fluorescence at either different time points or in different 
cell populations, and comparing the determinations to one another or to standards. In general, 
changes of at least about 50% from baseline are preferred, with changes of at least about 75%-100% 
being more preferred, changes of at least about 250% being particularly prefenred, and changes of at 
10 least about 1000-2000% being especially preferred. Baseline in this case means the styryl dye uptake 
of cells prior to exocytic stimulation. 

Preferred styryl dyes include, but are not limited to FM1-43, FM4^, FM14-68. FM2-10, FM4-84, 
FM1-84. FM14.27. FM14-29, FM3-25, FM3-14. FM5-55. RH414, FM6-55, FM10-75. FM1-81, FM9-49. 
FM4-95, FM4-59. FM9-40. and combinations thereof. Preferred dyes such as FM1-43 are only weakly 

15 fluorescent in water but very fluorescent when associated with a membrane, such that dye uptake is 
readily discemable. Suitable dyes are available commercially, i.e.. Molecular Probes. Inc.. of Eugene, 
Oregon, 'Handbook of Fluorescent Probes and Research Chemicals". 6th Edition, 1996. particulariy. 
Chapter 17, and more particularly. Section 2 of Chapter 17. (Including referenced related chapter), 
hereby incorporated herein by reference. Preferably, the dyes are provided in a solution wherein the 

2 0 dye concentration is about 25 to 1000- 5000 nM, witii from about 50 to about 1000 nM being preferred, 
and from about 50 to 250 being particularly preferred. The use of styryl dyes is further described in 
Betz, et al.. Current Opinion in Neurobiology, 6:365-371 (1996) also incorporated herein by reference. 
Preferably, fluorescent dye uptake is measured in combination with at least one, and preferably two 
other indicators of exocytosis activity. 

25 In anotiier prefenred embodinient. changes in fluorescent dye release are evaluated. The present 

invention is in part directed to ihe discovery that low pH concentration dyes, which are normally used 
to stain lysozomes, also low pH stain exocytic granules. Generally, these dyes can be taken up by the 
cells passively and concentrate In granules; however, the cells can be induced to take up the dye. i.e., 
by coupled endocytosis. In a po-. i. ied embodiment, a ceil population is bathed in a low pH 

30 concentration dye such that the dye is taken up -..c ci..^. The cells are preferably v/ashed. The 
cells can be exposed to a pro-exocytic stimulus and/or inhibitor. In a preferred embodiment, a 
candidate bioaqtive agent is combined witti the cell population and preferably, the pro-exocytic 
stimulus. Fluorescence is evaluated. Changes in fluorescent dye release between cells or at different 
time points in the same cell indicate alterations in exocytosis. Preferably, the alterations are between 

35 cells, and most preferably, between cells having different bioactive agents added thereto. Changes of 
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at least about 5% from baseline are preferred, with at least about 25% being more preferred, at least 
about 50% being particularly preferred and at least about 100% being especially preferred. Baseline in 
this case means the amount of dye in the cells prior to stimulation. 

In this embodiment, low pH concentration dyes are preferred. Such low pH concentration dyes include 
but are not limited to acridine orange. LYSOTRACKER^- red. LYSOTRACKER^ green, and 
LYSOTRACKER^ blue. Such dyes are commercially available, i.e., from Molecular Probes, supra, 
particularly including Chapter 17. Section 4 of Chapter 17. and referenced -related chapters", i.e.. 
Chapter 23. In preferred embodiments, the dyes are administered in a solution wherein the dye is a 
concentration of about 50 nM to about 25 pM. with from about 5 pM to about 25 pM being prefen-ed. 
and from about 1 to 5 pM being particularly preferred. The use of low pH concentration dyes is 
generally described (in regards to lysozome studies) in Haller, et al., Cell Calcium, 19(2):157-165 
(1996), hereby incorporated herein by reference. 

In an alternative embodiment wherein changes in fluorescent dye release are evaluated, the 
fluorescence released into the supernatant is evaluated. In this embodiment, either styryl dyes, which 
reversibly label endocytosed membranes, or low pH concentration dyes are used. In this embodiment, 
a cell population is bathed in dye such that the dye is taken up into the cells passively or by inducUon. 
The cells are then preferably washed. The cells can be exposed to a pro-exocytic stimulus and/or 
inhibitor, and optionally, a candidate bioactive agent. The cells which are exposed to a pro-exocytic 
stimulus will release the dye into the extracellular medium. The fluorescence in the medium can be 
measured or detected. This process is sometimes refenred to as destaining the cells. Optionally, an 
agent for improving and facilitating the detection of the dye In the medium can be added. For 
example, micelle-forming detergents such as 3-[(3-cholamidopropyl)dimethylammonlo] -1- 
propanesulfonate (CHAPS) increase the fluorescence and thereby allow detection of small amounts of 
exocytosis activity. Changes in the release of dye will indicate alterations in exocytosis in the same 
cell, between cells, and most preferably, between cells having different bioactive agents added 
thereto. In general, changes of at least about 5% from baseline are preferred, with at least about 
25% being more preferred, with at least about 50% being particularty preferred and at least about 
100% being especially preferred. Baseline in this case means the release of dye prior to exocytotic 
stimulus. Preferably, dye release when measured in the media is combined with the evaluation of at 
least one other exocytosis indicator. 

In a preferred embodiment, changes in granule exposure are determined. The granules are exposed 
to the media during exocytosis. i.e., the granules fuse with the cell membrane and expose/release 
their contents. Therefore, granule exposure is indicative of exocytic activity, and its absence is 
indicative that exocytosis has not been induced, or has been inhibited. Preferably, granule exposure is 
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detected by a detectable agent which specifically bind to granules. An example of a detectable agent 
used herein is annexin V, a member of a protein family which displays specific binding to phospholipid 
(phosphotidylserine) in a divalent ion dependent manner. This protein has been widely used for the 
measurement of apoptosis (programmed cell death) as cell surface exposure of phosphatidylserine is 
5 a hallmark early signal of this process. Surprisingly, it has been determined herein that annexin V 

specifically binds to exocytic granules when they are exposed at the cell surface during the secretory 
process; granules internal to the cell are unlabeled. This property of annexin V is used herein to 
create a single exocytosis assay based on its exocytosis dependent binding. Upon exocytic 
stimulation of cells, the cells show an increase in annexin binding and fluorescent signal in proportion 
10 in t>oth time and intensity to the exocytic response. 

In this embodiment, annexin is labelled, either directly or indirectly, and combined with a cell 
population. Annexin is commerdally available* i.e., from PharMingen. San Diego, California, or Caitag 
Lat}oratories, Millbrae, California. Preferably, the annexin is provided in a solution wherein the 
annexin is in a concentration of about 100 ng/ml to about 600 ng/ml, more preferably, about 500 ng/ml 

15 to about 1 pg/ml. and most preferably, from atx5ut 1 \jg/rT\\ to at)Out 5 pg/ml. In a preferred 
embodiment, the annexin is directly labelled: for example, annexin may be labelled with a 
fluorochrome such as fluorecein isothiocyanate (FITC). Alexa dyes. TRITC, AMCA, APC, tri-color. Cy- 
5, and others known in the art or commercially available. In an alternate preferred emt>odiment, the 
annexin is labelled with a first label, such as a hapten such as biotin. and a secondary fluorescent 

20 label Is used, such as fluorescent streptavidin. Other first and second labelling pairs can be used as 
will be appreciated by those in the art 

In the preferred embodiment, the cells are subjected to conditions that normally cause exocytosis. 
Optionally, a candidate bioactive agent is added to the cells. In some cases, it may be desirable to 
include an inhibitor of exocytosis to determine whether the candidate agent can reverse the inhibition. 

25 or to add the candidate bioactive agent without an exocytic stimulus to determine whether the agent 
induces exocytosis. The cells are preferably washed and fluorescence is detected in the microscope 
or on the flowcytometer. Alterations in the detection of annexin binding Indicates alterations in 
exocytosis in the same cell, or between diflerent cells, wiih or with the same conditions and/or agents 
combined therewith. In general, changes of at least aL : jt 25% from baseline are prefen-ed. with at 

30 liast about 50% being more pre.:. :roc. least about 100 being particularly preferred and at least 
about 500% being especially preferred. Baseline in this case means the amount of annexin binding 
prior to exocytic stimulation. 

In another preferred embodiment, granule exposure is detected by a cationic dye such as berberine or 
ruthenium red. Such cationic dyes specifically stain secreting granules. Thus, when exocytosis 
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occurs, and secreting granules are exposed at the cell surface, an increase in fluorescence can be 
detected. In a preferred embodiment, the cationic dye is combined with a cell population in the 
presence or absence of an exocytic stimulus and/or inhibitor, and opUonally. in the presence or 
absence of a candidate bioactive agent. In a particularly preferred embodiment, the berberlne is 
5 combined with a cell and an exocytic stimulus and a candidate bioactive agent to determine whether 
the candidate bioactive agent can modulate the exocytic activity. Preferably, the cells are washed and 
then fluorescence is determined. In preferred embodiments, cationic dye evaluation is combined w.th 
evaluation of at least one other indicator of exocytosis. The dye is combined with the cells as is known 
in the art. General methodologies describing berberine are described in Beriin and Enerback. Int. 
10 Arch Allergy Appl. Immunol.. 73(3):256-262 (1984) hereby incorporated by reference. In general, 
changes of at least about 5% from baseline are preferred, with at least about 25% being more 
preferred, at least about 50% being particulariy preferred, and at least about 100% being especially 
preferred. Baseline in this case means the amount of dye binding prior to stimulation. 

Similarly. Con A-FITC can be used, as It binds to the carbohydrate on granule proteins, in a manner 
1 5 similar to those outlined herein. 

In another preferred embodiment, changes in surface granule enzyme activity is determined. 
Secretory granules contain enzymes such as proteases and glycosidases which are released as part 
of the exocytic process. Frequently, these enzymes are inactive within the granule, due to the low pH. 
but upon exposure to the extracellular media at physiological pH. they become activated. These 
20 enzyme activities can be measured using chromogenic or fluorogenic substrates as components of 
the extracellular media. This allows detection of exocytic cells in varying approaches. 

In one embodiment, sometimes called herein the population based enzyme assay, tiie generation of 
signal via cleavage of a chromogenic or fluorogenic substrate can be quantified in the media. That is. 
the amount of detectable reaction product in the media is related to Uie amount of enzyme present. 
25 and thus to the amount of exocytosis. In this embodiment, it is the media, not the cells, that becomes 
detectable. 

In a preferred embodimenl. cells are subjected to an exocytic stimulus, and optionally, a candidate 
bioactive agent. The chromogenic or fluorogenic substrate is added to the media, and changes in the 
signal are evaluated, as the enzymes cleave the extracellular substrates. 

30 In an alternate preferred embodiment, sometimes called herein 'in situ enzymology assay", 
fluorogenic substrates that precipitate upon cleavage are used. That is. upon exocytosis a 
considerable amount of enzyme activity remains cell/granule associated and can be visualized using 
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fluorescent substrates which precipitate at the site of activity. For example, substrates for 
glucuronidase, such as ELF-97 glucuronide. precipitate on exocytosing cells, but not resting cells, and 
thus the cells can show increased fluorescence. The fluorescence is a direct measurement of 
exocytosis and is pH dependent reflecting the pH optima of the exocytosed enzyme. This method 
5 also provides a method of distinguishing different subtypes of granules based on their enzyme profile. 

In a preferred embodiment, the cell population is subjected to an exocytic stimulus and then incut>ated 
with a detectable substrate. A candidate bioactive agent is optionally added. The cells are washed 
and then viewed in the microscope orflowcytometer. 

Preferred granule enzymes include but are not limited to chymase, tryptase. arylsulfatase A. l>eta- 
10 hexosaminidase, beta-gtucuronidase, and beta-D-galactosidase. Substrates include ELF-97 
glucuronide, N-acetyl beta-D glucoronide. ELF-97 coupled to peptides, etc., many of which are 
commercially available, I.e., from Molecular Probes, supra , particular Chapter 10. more particularly 
Section 2 of Chapter 10, and referenced "related chapters". 

By detectable substrate Is meant that the substrate comprises a fluorescent molecule as further 
15 described herein, or can be detected with a fluorescent molecule specific for the substrate or cleaved 
substrate, i.e.. a fluorescent antitKxJy. In a preferred embodiment, the substrate comprises a 
detectable molecule formed of two fluorescent proteins, i.e., blue and green fluorescent protein (BFP 
and p- or rGFP), and other similar molecules. As is known in the art, constructs of p- or rGFP and BFP 
that hold these two proteins in close proximity allow fluorescence resonance energy transfer (FRET). 
2 0 That is. the excitation spectra of the p- or rGFP overlaps the emission spectra of the BFP. 

Accordingly, exciting the BFP results in p- or rGFP emission. If a protease cleavage site is engineered 
between the p- or rGFP and BFP to form a TRET construcr. upon exposure of the FRET construct to 
an active protease which cleaves the construct, the p- or rGFP and BFP molecules separate. Thus, 
exciting the p- or rGFP results In BFP emission and loss of BFP emission. 

25 Preferably, the protease dependent cleavage site inserted between two fluorescent proteins of the 
FRET construct is spedfic for a granule specific enzyme. Thus, the FRET construct can be used for 
detecting granule specific proteases specific for the cleavage site of the FRET construct. In this 
embodiment, the protease substrate that is combined with the ce!ls or media includes the FRET 
construct. The FRET system allows for detection of the detectable molecule in its cleaved and 

30 uncleaved state, and distinguishes between the two. The system is further described in Xu el al.. 

Nucleic Acid Res. 26(8}:2034 (1998); and MiyawakI et al.. Nature 388(664 5):882-887 (1997), both of 
which are incorporated by reference. 
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The amount of substrate added to the cells or media will depend in part on the enzyme s specific 
activity and the substrate itself, but generally is about 250 nM to about 1 mM. from about 1 pM to 
about 100 MM being preferred, and from about 1 pM to about 10 pM being particularly preferred. In 
general, changes of at least about 5% from baseline are preferred, with at least about 25% being 
preferred, at least about 100% being particularly preferred and at least about 1000% being especially 
preferred. Baseline in this case means the amount of substrate cleavage prior to induction of 
exocytosis. 

In a preferred embodiment, changes in the quantity of granule specific proteins are detennined. 
Secretory granules contain proteins which are specifically targeted to the granule compartment due to 
specific properties of these proteins. Upon exocytic Induction, the granule specific proteins are 
exposed to the surface and detected. 

In a preferred embodiment, detectable granule specific proteins are combined with a population of 
cells and subjected to conditions known to induce exocytosis. Optionally, a bioacUve candidate is 
combined with the cell population and detectable granule specific protein and the granule specific 
protein is detected. Granule specific proteins include but are not limited to VAMP and synaptotagmin. 
Also included within the definition of granule specific proteins are the mediators released during 
exocytosis. including, but not limited to, serotonin, histamine, heparin, hormones, etc. 

The quantification of the granule proteins may be done in several ways. In one embodiment, labelled 
antibodies, (such as fluorescent anUbodies). to granule specific proteins are used. In another 
embodiment, the cells are engineered to contain fusion proteins comprising a granule protein and a 
detectable molecule. In a preferred embodiment, a detectable molecule is added to the cells for 
detection. For example, either directly or indirecUy labelled antibodies can be used. A preferred 
embodiment uses a first labelled antibody, with fluorescent labels preferred. Another embodiment 
uses a first and second label, for example, a labelled secondary antibody. Generally, this embodiment 
may use any agent that will specifically bind to the granule protein or compound that can be either 
directly or indirectly labelled. 

In a preferred embodiment the labels are engineered into the cells. For example, recombinant 
proteins are introduced lo the cell i-opulation which are fusion proteins of a granule specific protein 
and a detectable molecule. This is generally done by transforming the cells with a fusion nucleic acid 
encoding a fusion protein comprising a granule specific protein and a detectable molecule. This is 
generally done as is known In the art. and will depend on the cell type. Generally, for mammalian 
cells, retroviral vectors and methods are preferred. 
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The fusion proteins are constructed by methods known in the art. For example, the nucleic acids 
encoding the granule specific protein is ligated with a nucleic acid encoding a detectable molecule. By 
detectable molecule herein Is meant a molecule that allows a cell or compound comprising the 
detectable molecule to be distinguished from one that does not contain it. I.e.. an epitope, sometimes 
called an antigen TAG, or a fluorescent molecule. Preferred fluorescent molecules include but are not 
limited to p- or rGFP, BFP, YFP, enzymes including luciferase and p-galactosidase. These constmcts 
can be made in such a way so that upon exocytosis an epitope, internal to the granule, is exposed at 
the cell surface and can then be detected. The epitope is preferably any detectable peptide which is 
not generally found on the cytoplasmic membrane, although in some instances, if the epitope is one 
normally found on the cells, increases may be detected, although this is generally not prefen-ed. 

In a prefen-ed embodiment, the cell population containing the fusion protein or detectable granule 
specific protein is subjected to exocytic conditions. Optionally, a candidate bioactlve agent and/or 
exocy«c inhibitor is included. Preferably, the cells are washed. Fluorescence is detected on the cells. 
In general, changes of at least about 5% from baseline are prefen-ed. with at least about 25% being 
more prefen-ed, at least about 50% being particulariy preferred and at least about 100% being 
especially preferred. Generally, baseline in this case means amount of fluorescence prior to exocytic 
stimulus. 

In the invention herein, the same characteristic of exocytosis is evaluated by multiple parameters 
which results in reduced background and greater specificity. In contrast. FACS has been used in the 
past to evaluate two different or unrelated characteristics at the same time which identifies cells having 
those two characteristics, but does not reduce the background for the combined characteristics. The 
present invention can, however. In addition to the identification of multiple exocytosis properties, be 
combined with the identification of other cellular parameters, as outlined above. 

In a preferred embodiment, the celts are subjected to conditions that normally cause exocytosis. Pro- 
exocytic agents Include ionomycin, Ca** , ionophores (lonomycin, A23187), compound 48/80, 
substance P. complement C3a/C5a. trypsin, tryptase, insulin, interieukin-3, specific IgE, allergen, anti- 
IgE. or anti-IgG receptor antibodies. These are provided at concentrations depending on the 
compound as is known in the art. ranging from 1 picomolar to 10 pM. generally. In some cases, it 
may be desirable to combine the cells with agents which inhibit exocytosis. Exocytosis inhibitors 
include but are not limited to Wortmannin, and Genestein, and others known in the art. 

In a preferred embodiment, the methods are used to screen candidate bioactive agents for the ability 
to modulate exocytosis. The candidate bioactive agents may be combined with the cell population 
before, during or after exocytosis is stimulated, preferably before. In some instances, it may be 
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desirable to determine the effect of the candidate bioactive agent, also referred to as "candidate 
agents" herein, on the cell wherein exocytosis is not induced or wherein exocytosis is inhibited. The 
candidate bioactive agent can be added to the cell population exogenously or can be introduced into 
the cells as described further herein. 

In a preferred embodiment, as above for cell cycle assays, a library of different candidate bioactive 
agents are used. 

As above, the candidate bioactive agents are combined or added to a cell or population of cells; again, 
as outlined above, prefen-ed embodiments utilize nucleic acid candidate agents and fusion partners; 
and preferably retroviral constructs. 

Wherein the candidate agents are nucleic acids, methods known In the art such as calcium 
phosphate, electroporation, and injection may be used to introduce these to the cells- The exocytic 
stimulus is generally combined with the cells under physiological conditions. Incubations may be 
perfomted at any temperature which facilitates opUmal activity, typically between 4 and 40*'C. 
Incubation periods are selected for opUmum activity, but may also be optimized to facilitate rapid high 
through put screening. 

As above, a variety of other reagents may be included in the assays, and the cells are sorted as 
above. The sorting results in a population of cells having the desired exocytic properties. In a 
preferred embodiment, the parameters are set to Identify at least one candidate bioactive agent that 
modulates exocytosis. 

In a preferred embodiment, the bioactive agent is characterized. This will proceed as will be 
appreciated by those in the art. and generally includes an analysis of the structure, identity, binding 
affinity and funcUon of the agent. Generally, once identified, the bioactive agent Is resyntheslzed and 
combined with the target cell to verify the exocytosis modulation under various conditions and in the 
presence or absence of other various agents. The bioactive can be prepared in a therapeutically 
effective amount to modulate exocytosis and combined with a suitable pharmaceutical carrier. 

In a preferred embodiment, the cell populations can be subjected to various experimental conditions, 
with and without the candidate agents, and with and without exocytic stimulation or inhibition. 
Changes in conditions include but are not limited to changes in pH, temperature, buffer or salt 
concentration, etc. In a preferred embodiment, the pH is changed, generally by increasing or 
decreasing the pH, usually by from about 0.5 to about 3 pH units. Alternatively, ttie temperature is 
altered, with increases or decreases of from about 5'C to about 30 *C being preferred. Simllariy. the 
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salt concentration may be modified, with increases or decreases of from about 0.1 M to about 2 M 
being preferred. 

In a preferred embodiment, the cellular phenotype to be modulated is small molecule (or other 
candidate agent) toxicity. These are generally as outlined above for cell viability assays. Small 
5 molecule dose responses can also be compared by comparing the cells with the greatest functional 
response, and then backgating to see if there is more or less toxicity associated with those cells. 

In a preferred embodiment, the cellular phenotype involves the expression or activity of cell surface 
receptors; up to sixteen cell surface mariners may be followed simultaneously, with up to eight being 
preferred. The presence or absence of any particular cell surface marker can be detected by directly 
10 and indirectly conjugated antibodies against any cell surface protein whose cell surface expression 
reflects an important functional parameter associated with the cells being studied. The effect of 
candidate agents such as small molecules can then be tested against individual or multiple markers. 

In a preferred emt)odiment, the cellular phenotype involves the expression or activity of enzymes such 
as fluorescent based reporter systems that can report a biological event that occurs simultaneously 
15 with the primary measurement or is a result of the primary measurement This reporter system can be 
a readout of upstream signal transduction pathways that are active in the cytoplasm, or of nuclear 
transcriptional or translational events, as well as export events from the nucleus or the cell. 

In a preferred embodiment, the cellular phenotype involves protein-protein interactions (or interactions 
between other binding ligands), such as dimerization. that can be either disrupted or instigated by a 
2 0 candidate agent. These events may be measured by the appearance or disappearance of FRET 
between two labeled binding ligands. 

All references cited herein are incorporated by reference. 

EXAMPLES 

Vector Construction 

25 Retroviral constructs were based on a pCGFP vector that cames a composite CMV promoter fused to 
the transcriptional start site fo the MMLV R-US region of the LTR. and extended packaging sequence, 
deletion of the MMLV gag start ATG, and a multiple cloning region encoding human codon-optimized 
EGFP (Clontech, Palo Alto. CA) and a Kozak consensus start, described in Kozak Ce// 44:283-292. 
The vector used to express flag tagged EGFP, pEf, is identical to pCGFP but has additional restriction 
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sites in the open reading frame of EGFP (resulting in 8 non-human optimized codons) and a Flag tag 
fused to the C-terminus of EGFP with the linker EEAAKA. 

pR and pP are retroviral expression vectors containing human codon-optimized Renilla muelleri and 
Ptilosarcus gumeyi GFPs (containing 9 and 1 1 non-optimlzed codons. respectively, to introduce 
restriction sites). Each has a Kozak consensus start and backbone vector sequence Identical to that 
of pCGFP and pEf. These vectors were made by annealing and ligating 20 syntheUc oligonucleotides 
(10 fonvard. 10 reverse for each GFP gene) creating a dsDNA fragment for each sequence shown in 
Table 1. These fragments were PGR amplified with respective primers: 
R fonward, 5" - 

GATCATAGAATTCGCCACCATGGGCAGCAAGCAGATCCTGAAGAACACCTGCCTG: 
P forward, 5'- 

GATCATAGAATTCGCCACCATGGGCAACCGCAACGTGCTGAAGAACACCGGCCTG; 
R and P reverse, 5- 

ATGATCGCGGCCGCTACACCCACTCGTGCAGGGATCCCAGGGGCTTGCCGATG: 

and cloned into the EcoRI/NotI restriction sites of pEf (replacing the Ef coding region). C-termlnal Flag 

tags were added to these GFPs through BamHI/Notl sites using annealed primers with stfcky 

overhangs: 

Forward, 5' - 

GATCCCTGCACGAGTGGGTGGA6GAGGCCGCCAAGGCCGACTACAAGGACGACGACGACAAG 

TAGGCCCGTGAGGCCCTAAGC; 
Reverse. 5' - 

GGCCGCTTAGGGCCTCACGGGCCTACTTGTCGTCGTCCTTGTAGTCGGCCTTGGCGGCCTCCT 
CCACCCACTCGTGCAGG: 

creating Rf and Pf. pRcDNA was made by removing the R mue/teri cDNA gene from pET-34 Native 
Renilla muelleri GPP (Prolume Ltd.. Pittsburg. PA) by PGR ampHfication with primers: 
Forward, 5* - 

GATCATGAATTCGCCATGAGTAAACAAATATTGAAGAACACT: 
Reverse, 5' - 

TAGATCGCGGCCGCTTAAACCCATTCGTGTAAGGATCCTAGTGG; 

and cloning into the EcoRI/NotI sites of pEf. Vectors containing codon optimized R. muelleri GFP with 
a llnker-HA tag-linker sequence inserted into each position A-F were created by the PGR sew 
technique of two fragments using primers shown above (R fonward and R reverse). The two 
fragments for A-F were made by PGR amplification of the 5' section of R with respective primers: 
R fonvard. shown above; 
' A rBVGrs6 5* * 

GTGGGGTAGTGGGGGACGTCGTAGGGGTAGGCAGGGGGCTGGGGGTCGTAGGGGAGGGTGGG 
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CTCGTAC; 
B reverse. 5* - 

CTGGCGTAGTCGGGCACGTCGTAGGGGTAGCCACCGCCCTGGCCCTCGATCAGGTTGATGTCG 
CTGCGG; 
5 C reverse, 5' - 

CTGGCGTAGTCGGGCACGTCGTAGGGGTAGCCACCGCCCTGGCCGTTCATGTACATGGCCTCG 

AAGCTG; 

D reverse, 5* - 

CTGGCGTAGTCGGGCACGTCGTAGGGGTAGCCACCGCCCTGGCCGTTAAGCTTGTACACAGGA 
10 TCACC; 

E reverse, 5* - 

CTGGCGTAGTCGGGACGTCGTAGGGGTAGCCACCGAAATGGAAGAAATTGCTCTTCATCAGGG 

TCTTC; 

F reverse, 5' - 

15 CTGGCGTAGTCGGGCACGTCGTAGGGGTAGCCACCGCCCTGGCCGCCGCCGTCCTCCACGTA 
GGTCTTC: 

and the 3* section of R with respective primers; 
A fonvard. 5* - 

CCTACGACGTGCCCGACTACGCCAGCCTGGGCCAGCAGGTGGAGGCGACGGCGGCCTGGTGG 
20 AGATCCGCA: 
B forward, 5' - 

CCTACGACGTGCCCGACTAGCCAGCCTGGGCCAAGCAGGTGGAGGCGACAAGTTCGTGTACCG 

CGTGGAGT; 

C forward, 5' - 

25 CCTACGACGTGCCCGACTACGCCAGCCTGGGCCAAGCAGGTGGAGGCAACGGCGTGCTGGTG 
GGCGAGGTGA: 
D forward, 5' - 

CCTACGACGTGCCCGACTACGCCAGCCTGGGCCAAGCAGGTGGAGGCAGCGGGAAGTACTACA 
GCTGCCACA: 

30 E forward, 5' - 

CCTACGACGTGCCCGACTACGCCAGCCTGGGCCAAGCAGGTGGAGGCGTGGTGAAGGAGTTC 

CCCAGCTACC: 

F forward, 5' - 

CCTACGACGTGCCCGACTACGCCAGCCTGGGCCAAGGAGGTGGAGGCTTCGTGGAGCAGCAC 
3 5 GAGACCGCCA. The PGR sewed fragments were put into the EcoRI/NotI sites of pEf. 

The bacterial expression vector for purification of PtHosarcus GFP was created by PGR amplficalion of 
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pP with primers: 
forward. 5' - 

AGATCATAGATCTATGGGCAACCGCAACGTGCTGAAGAACACCGGCCTG; 
P reverse, shown above. 

Digestion of the fragment withBglll/NotI and ligation into the BamHI/NotI restriction sites of pGEX6P-1 
(Pharmacia Biotech. Piscataway. New Jersey). The vector containing R muelleri GFP with C10G and 
C35E mutations (obsen^ed to aid In the folding of the protein in bacteria) was created by PGR sewing 
together a fragment created by annealing and extending primers: 
forward. 5* - 

AGATCATAGATCTGAATTCATGGGCAGCAAGCAGATCCTGAAGAACACCGGCCTGCAGGAGGTG 

ATGAGCTACAAGGTGACCTGGAGG; 

reverse, 5' - 

GCCAACAGGATGTTGCCGTTGCGCTCGCCCTCCATGGTGAACACGTGGTTGTTAACGATGCCCT 

CCAGGTTCACCTTGTAGCTCATCAC; 

R reverse, shown above. 

The sewed product was digested Bglll/NotI and ligated into the BamHI/NotI sites of pGEX6P-1. 
Cells and Retrovirus Transduction 

Phoenix E retroviral packaging cells, described in Swift et a/.. Current Protocols In Immunology (1999) 
10.17C:1-17, were carried In 10% fetal bovine semm with 1% penicillin-streptomycin (JHR 
Bioscience. Williamsburg, VA) and Dulbecco's modified Eagle media (Mediatech Cellgro. Hemdon, 
VA). Juricat cells stably expressing the ecotropic receptor (Juricat E) were carried in 10% fetal calf 
semm with 1% penicillin-streptomycin in RPMI 1630 media (JRH Bioscience, Williamsburg. VA). 
Calcium phosphate transfection of Phoenix E cells and infection of Jurket E cells and infection of 
Juri^et E cells was canied out as described in Swift a/. 

Gel Filtration 

Gel filtration was carried out on a 1 x 30 cm Pharmacia Superdex 75 column, equilibrated in 
phosphate buffered saline and eluted at 0.3 ml/min. at 22®C. The column was on a Hewlett-Packard 
1100 HPLC system equipped with a standard fluorescence detector with an Spl flow cell. GFP peaks 
were detected by absorption at 489nm or by fluorescence emission at 512nm. Fluorescence 
excitation spectra were recorded with a fixed emission wavelength at 549nm. and emission spectra 
were recorded at a fixed excitation wavelength of 450nm. 

FACS and Microscopy 

Flow-cytometry analysis and cell sorting of GFP expressing cells were performed on a FACScan 
(Beckton-Dickson. San Jose. CA) or MoFlo (Cytomation, Fort Collins. CO) instrument, and data 
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analyzed using FloJo software (Treeslar Software. San Carlos. CA). Live cells were gated on by 
scatter and propium iodide staining during data analysis. GFP fluorescence intensity measurements 
(Geometric mean) were of GFP positive cells only. Cells expressing GFP were visualized using Nikon 
Ellipse TE300 fluorescence microscope. 

Westem Analysis 

For preparation of whole-cell lysates. identical numbers of ceils were collected, washed in PBS and 
prepared in lysis buffer (50mM Hepes pH 7.4. 150mM NaCI. 5mM EDTA. 5mM EGTA. 1% TritonX- 
100) with added Complete EDTA-free protease inhibitor cocktail (Boehringer l^annheim. Chicago. II). 
Lysate cleared by centrifugation was resolved on 4-12% NuPage SDS polyacrylamide gels (Novex. 
San Diego. CA) as per the manufacturer's recommendations. Samples transferred to PVDF 
membranes were blotted using tO% Milk, 0.1% Tween20 in IX PBS blocking buffer with rabbit 
polyclonal flag-probe (Santa Cruz Biotechnology, Santa Cmz, CA) at a 1:2000 dilution and goat anti- 
rabbit IgG-horse radish peroxidase conjugate (Sigma. St. Louis. MO) secondary at a 1:5000 dilution. 
Membranes were detected using ECL plus enhanced chemiluminescence kit (Amersham Pharmacia, 
Piscataway. NJ) and Hyperfilm ECL film (Amersham Life Sciences. Buckinghamshire. UK). Exposed 
film was scanned with a Hewlett Packard (Palo Alto. CA) ScanJet 4C scanner and band intensities 
were integrated using the program NIH Image (see http://rsb.info.nih.aov/nih-imaae/about.htmlV 

GFP Purification from E. coli 

All components used for purification of the GFP gene products were from Pharmacia Biotec 
(Piscataway, NJ) except as noted. The human codon-optimized gene for each protein was expressed 
in BL21 TIL codon plus (DE3) E. coli (Stratagene, San Diego, CA) as a fusion protein with glutathione 
S-transferase from pGEX6p-1 derived vectors. Each protein was purified using Glutathione 
Sepharose 4B beads as per the manufacturer's directions, and the mature GFP was removed from 
the protein with Precision Protease. The purified proteins ran as single bands by SOS-PAGE and 
appeared as single peaks of the expected molecular mass by MALDI-TOF mass spectometry on a 
Bruker Reflex III Instrument (Bruker Daltonics. Billerica. MA). Due to the cloning strategy, purified R 
muelieri GFP has the amino acids PLGSEF- and PtHosarcus GFP the residues GPLGS- fused to their 
N-termini. Purified recombinant EGFP was from Clontech (Palo Alto, CA). 

CD Studies 

CD spectra were recorded as described In Gururaja et aL. Chem. BioL (2000) In press. Spectra were 
recorded between 200 and 250nm at 0.2nm intervals with a time constant of 1s. Data was collected 
from five separate scans and averaged. The protein concentratk^ns were in the range of 5 to lOpM, 
as determined by the Lowery method, described in Lowry et aL, J. BioL Chem, (1951 ) 193:265-275. 
Protein solutions were made in lOmM phosphate buffer containing lOOmM KF at pH 7.5. and were 



P134824A2.I_> 



wo 01/34824 



-74- 



PCT/US00/3091$ 



diluted in the same buffer to yield appropriate the final coiKentration. The thermal denaturatlon was 
measured at 218nm over a rar^ge of 4-98''C with a temperature step of 2''C. a 2 minute equilibration 
time, and a 60s signal averaging time. The apparent T„ was also determined by fitting the data to a 
logistic sigmoid equation using the Levenberg-Marquardt algorithm in Ultrafit (Biosoft. Cambridge. 
UK). In addition, the apparent T„ was detenmined as the maximum of the first derivative of the CD 
signal with respect to temperature. Both methods of T„ calculation agreed well. CD spectra were 
deconvoluted with the program CDNN (CD neural network) downloaded from 
httD://bioinfQrmatik.biochemtech.uni-halle.de/cdnn/index.html. 
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CLAIMS 

We claim: 

1. A retroviral vector comprising a p- or rGFP gene. 

2. A retroviral vector comprising a first gene, and IRES site, and a p- or rGFP gene. 
5 3. A cell comprising a retroviral vector according to claim 1 or 2. 

4. A library of fusion nucleic acids, each fusion nucleic acid comprising: 

a) a gene encoding a random peptide; and 

b) a gene encoding a p- or rGFP. 

5. A library according to claim 4 wherein said fusion nucleic acid further comprises a fusion partner. 

•I 

10 6. A library of cells comprising a library of fusion nucleic acids according to claim 4 or 5. 

7. A library of retroviral vectors comprising a library of fusion nucleic acids, each fusion nucleic acid 
comprising: 

a) a gene encoding a random peptide; and 

b) a gene encoding a p- or rGFP. 

15 8. A library of cells comprising a library of retroviral vectors according to claim 7. 

9. A library of celts according to claim 6 or 8 wherein said cells are mammalian. 

10. A' method of screening for bioactive agents capable of inhibiting an IL-4 inducible e promoter, said 
method comprising 

a) combining a candidate bioactive agent and a cell comprising a fusion nucleic acid 
20 comprising: 

i) an IL-4 inducible e promoter; and 

ii) a Renilla green fluorescent protein (p- or rGFP); 

b) inducing said promoter with IL-4; and 

c) detecting the presence or absence of said p- or rGFP; 

25 wherein the absence of said p- or rGFP Indicates that said agent inhibits said IL-4 inducible e 
promoter. 
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1 1. A cell line for screening selected from the group consisting of CA-46 and MC-1 16. said cell line 
comprising a fusion nucleic acid comprising: 

a) an IL-4 Inducible c promoter; and 

b) a p- or rGFP. 

5 12. A method of screening for btoactive agents capable of modulating IgE production, said method 
comprising: 

a) combining a candidate bioactive agent and a cell comprising nucleic acid encoding an IgE 
fusion protein comprising: 

i) the e heavy chain; and 
10 ii) a p- or rGFP; 

b) determining the amount of IgE produced in said cell; 

wherein a change In the amount of IgE as compared to the amount produced in the absence of said 
candidate agent indicates that said agent modulates IgE production. 

13. A method of screening for bioactive agents capable of modulating the activity of a promoter of 
1 5 interest, said method comprising: 

a) combining a candidate bioactive agent and a cell comprising a fusion nucleic acid 
comprising: 

i) a promoter of interest; and 

ii) nucleic acid encoding a p- or rGFP protein; 
20 b) optionally inducing said promoter; 

c) detecting the presence of said p- or rGFP protein. 
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Honiologv Comparison of Pitiloaarcua Qurneyi and Renilla Muleri 
GFPa with Aecnior ea Victoria wt and Enhanced- GPPb 

Pi t i 1 osar CU8 NMiUMVXiXimSUQBXMSAXASVEGXVSOiKVFSI^ 
Renilla MSKQIIiKMTCIiQEVMSYKVNI^ZVMIIIIVPTMB^ 

BC3FP(Aequorea) HSKQEELFTGWPIl. VELIXHn/IKmKFSVSGBQEXSDATYGKLTIiKPl 

Aequorea MSKGE&LFTGWPIL VELDGDVNQHKFSVSGBOEGDATYGKLTLKFI 

* : :: ..::**♦.*♦::.**:*: :*: 
Prim, consensus MSXGE222TG2V2I2S2KVEL2G2VM2H2FS22GBG2G2A22G222L222 



Pitilosarcue 
Renilla 
EGFP 
Aequorea 

Prim. cons. 

Pitilosarcus 
Renilla 
EGFP 
Aequorea 

Prim. cons. 

Pitilosarcus 
Renilla 
EGFP 
Aequorea 

Prim, cons . 



VTKGG PLPFAFDIVSIA FQYQ NRTFTKYPPDIA" - DYPVQSFPAGPFYER 
VTKGAPLPFAFDIVSPAFflYgNRTFTKYPNDIS— DVTIQSFPAGFMYER 
CTTQ-ia*PVPWPTLVTTL3nreV0CFSRYPDHMKQHDFPKSAMPEGYVQER 
CTTG-IOiPVPWPTLVTTFSXayQCFSRYPDHMKQHDFFKSAMPEGYVQER 

2T2G22LP2222222T2Ffl3ra222F22YPD22KQHD2FK222P2G2V2ER 
Chromophore 

NLRFEDGArVDIRSDISLEDDKFHYKVEYRGNGFPSNGPVMQKAIIiGMEP 

TLRYEDGGLVEIRSDINLIEDKFVYRVEYKGSNFPDDGPVMQKTILGIEP 

TXFFKDIXSNYXTRAEVKFEGDTLVimXELKGIDFKEIXSNILGHK^ 

TIFFKX)]XaiYia'RAEVKFEGiyrLVNRIEZ«KGIDFKEIXaiXLGHK^ 

.: . ::♦ :* :: : : 

T22F2D2GN2K2R222K2EGD22V2R2E2KGIDF2EDG22222K222N22 

SFEWYMN SGVL>VGEVDX#VYKL£SGNYYSCKHKTFYRSXGGVXEFP 

SFEAMYMM l^VLVGEVILVYKt^SGKYYSCHMKTUQCSKGVVKEFP 

SHNVYIKADKQKNGIKVNFKIRHNIBTOSVQLADHYQQOTPIQTO^^ 
SUNVYXHADRQKNG IKVNFKlRHNIEDGSVQIiADHYQQMTPIGDGPVLI.P 
• ... • • • ^ ~ 

S22VY2M2DRQKNG22V222I22222D22V2222H222MTP222G2222P 



Pitilosarcus 
Renilla 
BGFP 
Aequorea 

Prim, cons . 



EYHFIHHRI*EKTYVEBGSFVEQHETAIAQLTTIGKPLGSLHEWV 
SYHFIQKRLEKTYVEIXX3FVEQHETAXAQMTSIGKPI/3SLHEWV 
DNHYLS — TQSALSKDPNEKRDHXVI«LEFVTAAGITliGMD£LYK 

DNHYLS- -TQSAI^KDPME]CRDHHVI«LJBFVTAAGITLGMDSLYK 

.. • • ^ • It • 

• •* • 

D2H22SHR2222222D2N2222H222222VTA2G22IiG222222 



AlisnnietitdAta: 

PHBoswcns G. vs. Acqoorai 

IdeaUty {*) : 55 is 22.45 % 
Strongly similar (:) : 61 is 24.90 % 
Weakly similar (.) : 35 is 14^9 % 
IMffereot : 94 is 3837 %\ 

Renilla M. Vs. Aequoroi 
Identity (*) : 60 is 24.39 % 
Strongly similar (:) : 65 is 26.42 % 
Weakly similar (.) : 28 is 1 1 38 % 
lIHfrerem : 93 is 37.80 % 



Renilla irs. PiCilosarciis 
Alignment length : 238 
Wcnlilyr): 184 Is 77.31 % 
Strongly similar (:) : 3 1 is 1 3.03 % 
Weakly similar (.) : 14 is 5.88 % 
Different : 9 is 3.78 % 

Beau Peelle lO/lS/99 
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(Xi) SEQOENCE DBSCRZPTZOH: SEQ ZD KOsXSi 

GGTTATACAC AAGTOTATCG CGTATCTGCA GACGCATCTA GTGGOATTAT TCX3AGCX»3TA 60 

GTATITACGT CAGACCTQTC TAATOGAAAC CACAACAAAC TCTTAAAATA AGCCACATTT 120 

ACATA ATATC TAAGAGAOOC CTCAZTIAAO AOTAOTAAAA ATATAATATA TQATAQAGTA 180 

TA CAAC TCTC GCCXTAGAGA GACAGTGTGC AACAC5AGTAA CTCTTGTTAA TGCAATCGAA 240 

A6CGTCAAGA OAQATAAG ATO AGT AAA CAA ATA TTQ AAO AAC ACT TGST TTA 291 

Met Ser Lya Gin Zle Ijeu Lye Asn Thr Cys Leu 
15 10 

CAA GAA GTA ATO TCG TAT AAA GTA AAT CTO GAA GQA ATT GTA AAC AAC 339 
Gin Glu Val Met Ser Tyr Lys Val Asn Leu Glu Gly Zle Val Aen Asn 
15 20 25 

CAT GIT TTT ACA ATG GAG GGT TGC GGC AAA GGG AAT ATT TTA TTC GGC 387 
His Val Phe Thr Met Glu Gly Cys Gly Lys Gly Asn Zle Leu Phe Gly 
30 35 40 

AAT CAA CTG GTT CAG ATT CGT GTC ACQ AAA GGG GCC CCA CTG CCT TIT 435 
Asn Gin Leu Val Gin Zle Arg Val Thr Lys Gly Ala Pro Leu Pro Phe 
45 50 55 

OCA TTT GAT ATT GTG TCA CCA GCT TTT CAA TAT GGC AAC CGT ACT TTC 483 
Ala Phe Asp Zle Val Ser Pro Ala Phe GXn Tyr Gly Asn Arg Thr Phe 
60 65 70 75 

AOG AAA TAT CCG AAT GAT AXA TCA GAT TAT TTT ATA CAA TCA TTT CCA 531 
Thr Lys Tyr Pro Aan Asp Zle Ser Asp Tyr Phe Zle Gin Ser Phe Pro 
80 85 90 

GCA GGA TTT ATG TAT GAA CGA ACA TTA CGT TAC GAA GAT GGC GGA CTT 579 
Ala Gly Phe Met Tyr Glu Arg Thr Leu Arg Tyr Glu Asp Gly Gly Leu 
95 100 105 

GTT GAA ATT CGT TCA GAT ATA AAT TTA ATA GAA GAC AAG TTC GTC TAC 627 
Val Glu Zle Arg Ser Asp Zle Asn Leu Zle Glu Asp Lys' Phe Val Tyr 



110 115 120 

AGA GTG GAA TAC AAA GGT AGT AAC TTC CCA OAT GAT GOT CCC OTC ATO 675 
Arg Val Glu Tyr Lys Gly Ser Asn Phe Pro Asp Asp Gly Pro Val Met 
125 130 135 

CM AAG ACT ATC TTA GGA ATA GAG CCT TCA TOT GAA GCX: 723 
Gin Lys Thr Xle X*eu Gly Zle Glu Pro Ser Phe Glu Ala Met Tyr Met 
140 145 ISO 155 

AAT AAT GGC GTC TTO GTC GGC GAA GTA ATT CTT GTC TAT AAA CTA AAC 771 
Asn Asn Gly Val I-eu Val Gly Glu Val Xle Leu Val Tyr Lys Leu Asn 
160 165 170 

TCT GGG AAA TAT TAT TCA TGT CAC ATG AAA ACA TTA ATG AAG TCG AAA 819 
Ser Gly Lys Tyr Tyr Ser Cys His Met I*ys Thr Leu Met Lys Ser Lys 
175 ISO 165 

GGT GTA GTA AAG GAG TTT CCT TCG TAT CAT TTT ATT CAA (».T CG^ 
Gly Val Val Lys Glu Phe Pro Ser Tyr His Phe Zle Gin His Arg Leu 
190' 195 200 

GAA AAG ACT TAC GTA GAA GAC GGG GGG TTC GTT GAA CAG CAT GAG ACT 915 
Glu Lys Thr Tyr Val Glu Asp Gly Gly Phe Val Glu Gin His Glu Thr 
205 210 215 

GCT ATT GCT CAA ATG ACA TCT ATA GGA AAA QCA CTA GGA TCC TTA CAC 963 
Ala lie Ala Gin Met Thr Ser Zle Gly Lys Pro Leu Gly Ser Leu His 
220 225 230 235 

GAA TOO OTT TAA ACACAGTTAC ATTACTTTTT CCAATTCGTO TTTCATGTCA AATAAT 102X 
Glu Trp Val • 

AATTTTTTAA ACAATTATCA ATQTTTTGTG ATATOTTTOT AAAAAAAAAA AAAAAAAA 1079 
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(Xi) SEQUEKCE DESCRIPTION: SEQ ID NO: 30: 

TCGGCACGA6 CTGGCCICCA CACTTTAG^C AAA ATO AAC CX5C AAC GTA TTA AAG 54 

Met Abd Arg Aen Val i.eu Lys 
1 5 

AAC ACT GGA CTG AAA GAG ATT ATG TCG GCA AAA GOT AGO GTT GAA GGA 102 
Asn Tlir Gly Leu Lys Clu lie Met Ser Ala Lys Ala Ser Val Glu Gly 
10 15 20 

ATC CTG AAC AAT CAC GTT TTT TCC ATG GAA GGA ITT GGA AAA GGC AAT 150 
lie Val Asn Asn His Val Pbe Ser Met Glu Gly Phe Gly Lys Gly Asn 
25 30 35 

GTA TTA TTT GGA AAC CAA TTG ATG CAA ATC COG GTT ACA AAG GGA GGT 198 
Val Leu Phe Gly Asn Gin Leu Met Gin He Arg Val Thr Lys Gly Gly 
40 45 50 55 

COO TTG CCA ITC GCT TTC GAT ATT GTT TCC ATA OCT TTC CAA TAG GGG 246 
Pxt> Leu Pro Phe Ala Phe Asp He Val Ser He Ala Phe Gin Tyr Gly 
60 65 70 

AAT OOC ACT TTC ACG AAA TAC CCA GAC GAC ATT 008 GAC TAC TTT OCT 294 
Asn Arg Thr Phe Thr Lys Tyr Pro Asp Asp He Ala Asp IVr Phe Val 
75 -80 85 

CAA TCA TTC CCG GCT GGA TTT TTC TAC GAA AGA AAT CTA COC TTT GAA 342 
Gin 8er Phe Pro Ala Gly Phe Phe Tyr Glu Arg Asn Leu Arg Phe Glu 
90 95 100 

^ 2?^ ^ °" COT TCA GAT ATA ACT TTA GAA GAT GAT 390 

^ ^* ^ ^ ^ fi« Oltt Asp Asp 

105 no xiS 

MO TTC OVC TAC AAA GTO GAG TAT AGA GGC AAC OGT TTC OCT AGT AAC 438 
lor; Phe Sis Tyr Lys Val Glu Tyr Arg Gly Asn Gly Phe Pro Ser Asn 
"0 125 130 135 

GGA CCC GTG ATG CAA AAA GCC ATC CTC GGC ATG GAG CCA TCG TTT GAG 486 
Gly Pro Val Met Gin Lys Ala He Leu Gly Met Glu Pro Ser Phe Glu 
1«0 145 ISO 

GTO GTC TAC ATO AAC AOC GGC GTT CTG GTO GGC GAA GTA GAT CTC GTT 534 
Val Val Tyr Met Asn Ser Gly Val Leu Val Gly Glu Val Asp Leu Val 
155 160 165 

TAC AAA CTC GAG TCA GGG AAC TAT TAC TOG TGC CAC ATG AAA ACG TTT 582 
Tyr Lys Leu Glu Ser Gly Asn Tyr Tyr Ser Cys Bis Met Lys Thr Phe 
170 175 180 

TAC AGA TCC AAA GGT GGA GTG AAA GAA TTC CCG GAA TAT CAC TTT ATC 630 
Tyr Arg Ser Lys Gly Gly Val Lys Glu Phe Pro Glu Tyr Hie Phe He 
IBS 190 
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CAT CAT CGT Cro GAG ACC TAC <nO aW^ GWV 6(» JUTC TTC GTG 678 

SiG aio Ars Uau Glu Lys T»ir Tyr Val Glu Olu Oly Ser Phe Val Glu 



200 205 ■ 2X0 21S 

CAA CAC GAG ACQ GCC ATT 6CA CAA CTO ACC ACA ATT GOA AAA CCT CTG 726 
Gin Hio Glu Thr Ala lie Ala Gin Leu Thr Thr He Gly hye Pro Leu 

220 225 230 

OGC TCC CTT CAT GAA TGG GTG TAG AAAATGACCA ATATACT6GG GAAACCGATA 780 
. Gly Ser Leu Bio Glu Trp Val 

235 

ACCGTTTGGA AGCTTOIGTA TACAAATTAT TTGGGGTCAT TTTQTAATGT GTATGTGTGT 840 
TXTTATGATCA ATAGACGTCG TCATTCATAG CTTGAATCCT TCAGCAAAAG AAJ^CTCGAA 900 
GO^TTGAA ACCTCGAAOC ATATXGAAAC CTCOACGGAG AGCGTAAAGA GACCGCACAA 960 
X?^OTCGT TTOACCAGC AOTTOGAATC TTTAA ACCGA TCAAAACTAT T AATATAAAT 1020 
ATATAXATOC TC^A^ ATATAXATCT ATATAGITTO ATATiaATTA AATCTGTTCT 1080 
TGATCAAAAA AAAAAAAAAA AAAA "«»4 



Cxl> SEQOENCE DESCUPTZOHt 8BQ ZD N0s3Xs 



GACMUl ATO AAC CGC AAC GTA TTA AAG 27 
Met Asa Axg Asn Val Xjcu lore 
X 5 

AACACTGGACiaAAAGAGATTATGTCOGCAAAAGCTAGCGCTQAAG^ 75 
Asn Thr Gly Leu Lyo Glu lie Met Ser Ala Lye Ala Ser Val Glu Gly 
10 15 20 

ATC GXG AAC AAT CAC GTT TTT TCC ATG GAA GGA TTT GGA AAA GGC AAT 123 
Xle Val Asn Asn Hia Val Phe Ser Met Glu Gly Phe Gly Lys Gly Aan 
25 30 35 

GTA TTA TTT GGA AAC CAA rrO ATO CAA ATC CGG GTT A« "1 
Val Leu Phe Gly Asn Gin Leu Met Gin lie Arg Val Thr Lye Gly Gly 
40 45 50 55 

CCG TTO CCA TIC GCT TTC GAC ATT GTT TCC ATA GCT TTC CAA TAC GOG 219 
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Pro Leu Pro Phe Ala Phe Aap He Val Ser He Ala Phe Gin Tyr Gly 



«S 70 



AAT CGC ACT TTC ACG AAA TAC CCA GAC GAC ATT GCG GAC TAC ITT 6TT 267 
Asn Arg Thr Phe Thr Lyo Tyr Pro Asp Asp lie Ala Asp Tyr Phe Val 
'5 80 65 

S?* IP ?P S?^ ^ TAC GAA AGA AAT CTA CGC TXT GAA 315 

Gin Ser Phe Pro Ala Gly Phe Phe Tyr Glu Arg Asn Leu Axg Phe Glu 
90 95 100 



S^I f?5 T?I S!? ?P '^TA AGT TTA GAA GAT GAT 363 

Asp Gly Ala lie Val Asp lie Arg Ser Asp He Ser Leu Glu Asp Asp 
lOS no 

S^*^ T'^T AGA GGC AAC OGT TTC CCT AGT AAC 411 

Lys Phe His Tyr Lys Val Glu Tyr Arg Gly Asn Gly Phe P^ sS JSn 
125 130 3.35 

2?^ S"' 5^ CTC GGC ATG GAG CCA TCG m GAG 4S9 

Gly Pro Val Met Gin Lys Ala He Leu Gly Met Glu Pro Ser Phe GlS 
1*0 145 ISO 

^ Si5 ii'? ^ °" GAA GTA GAT CTC GTT 507 

val val Tyr Met Asn Ser Gly Val Leu Val Gly Glu Val Asp Leu Val 
155 160 165 

TAC AAA CTC GAG TCA GG6 AAC TAT TAC TOG TGC CAC ATC AAA ACM TTT eee 
Tyr Lys Leu Glu Ser Gly Asn Tyr Tyi IS jj« ^ SI 

175 X80 

»C AQA TCC AAA GOT GGA GTO AAA GAA TTC 

lyr Arg Ser Lys Oly Gly Val Lya Glu Phe Pro Glu Tyr Hi e Phe Xle 
las 190 2.9S 

His His Arg Leu Glu Lj^ Thr tyr Val oiu Glu Oly Ser Phe Val Glu 
200 205 210 ' 215 

CAA CAC GAG ACG GCC ATT CCA CAA CTO ACC ACA ATT GGA AAA CCT CTQ 
Gin His Glu Thr Ala He Ala Gin Leu Thr tS lii 5^ lJ^ ^ 2S 

22s 230 

S IS SI ^ '^'^ ®^ 

AWWIACTGO GGAAAATOAC CAATTrACXO GSOAAAAIOA CCAATATACT GTAGAAAATC Bf» 
ACCAAIATAC TOGGGAAAAT GACCAATTTA CTOQO^AT GMOmS CTOTaaSSI? Ill 
TOCWMTAT ACTGTGGAAA ATOACCAAAA T^SrSSS^ MoSSoS toSctS^ III 

ACx my ^ ui A TAA CLii r rm gaaocttgtg tatacaaott jorxomc attttctwS III 

GlbmUiUT CTTCTATGAT CTATAGACGT OmaTTCAT ABCTIWaC CTTO^uS llll 
AGAAACCTCG AAGCATATTG AAACCXCGAC GGAGAGCATA ^GAOACCOC AC^^SS^ lill 
WTXATOATAC CAOCAGTTGG AATCTTTAAA CCGATCAjSa CTAtS^M ^ATaSc lln 
CCXGTAIAAC ATATATATAT ATATATATCT ACATAeTTTO AdStoATTA JUlTCTeTTrS llll 
TGMCACTAA A AA A AAAAAA AAAAAAAAAA AAAAAAAAAA MUuSa AATCIGTTCT 1233 



603 



651 



699 
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A retroviral vector comprising a Green Fluorescent Protein 
from Ptilosarcus (pGFP); a cell commprlsing said retroviral 
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of fusion nucleic acids, each fusion nucleic acid comprising 
: a) a gene encoding a random peptide; and b) a gene 
encoding a pGFP; a library of cells comprising a library of 
said retroviral vectors. 
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from Renllla (rGFP); a cell comprising said retroviral 
vector; a library of retroviral vectors comprising a library 
of fusion nucleic acids, each fusion nucleic acid comprising 
: a) a gene encoding a random peptide; and b) a gene 
encoding a rGFP; a library of cells comprising a library of 
said retroviral vectors. 
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of retroviral vectors comprising such a library of fusion 
nucleic acids; a library of cells comprising such a library 
of said retroviral vectors; 
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A library of fusion nucleic acids, each fusion nucleic acid 
comprising : a) a gene encoding a random peptide; and b) a 
gene encoding a Renilla rGFP; a library of cells comprising 
such a library of fusion nucleic acids; a library of 
retroviral vectors comprising such a library of fusion 
nucleic acids; a library of cells comprising such a library 
of said retroviral vectors; 
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green fluorescent protein (p- or rGFP); cell line comprising 
said fusion peptide; a method of screening for bioactive 
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agents capable of modulating IgE production using a cell 
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modulating the activity of a promoter of interest using a 
cell comprising a fusion nucleic acid comprising a promoter 
of interest and a nucleic acid encoding a p- or rGFP; 
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